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Abstract 


In many firms the pricing and inventory control functions are separated: the marketing de- 
partment determines optimal prices first and then logistics decides on optimal stocking quan- 
tities, taking demand as exogenous and only considering incremental costs. However, a 
number of theoretical models suggest a joint determination of inventory levels and prices, as 
prices also affect stocking risks. In this work, we address the problem of simultaneously de- 
termining a pricing and inventory replenishment strategy under reference price effects. This 
reference price effect models the empirically well established fact that consumers not only 
react sensitively to the current price, but also to deviations from a reference price formed on 
the basis of past purchases. The current price is then perceived as a discount or surcharge 
relative to this reference price. Thus, immediate effects of price reductions on profits have 
to be weighted against the resulting losses in future periods. We study how the additional 
dynamics of the consumers’ willingness to pay affect an optimal pricing and inventory con- 
trol model and whether a simple policy such as a base-stock-list-price policy holds in such a 
setting. 


For a one-period planning horizon we analytically prove the optimality of a base-stock- 
list-price policy with respect to the reference price under general conditions. We then extend 
this result to the two-period time horizon for the linear and loss-neutral demand function 
and to the multi-period case under even more restrictive assumptions. However, numerical 
simulations suggest that a base-stock-list-price policy is also optimal for the multi-period 
setting under more general conditions. We furthermore show by numerical investigations that 
the presence of reference price effects decreases the incentive for price discounts to deal with 
overstocked situations. Moreover, we find that the potential benefits from simultaneously 
determining optimal prices and stocking quantities compared to a sequential procedure can 
increase considerably, when reference price effects are included in the model. This makes an 
integration of pricing and inventory control with reference price effects by all means worth 
the effort. 
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1. Introduction 


1.1. Problem description 


Recent years have witnessed increased interest on the part of retail and manufacturing com- 
panies in investigating innovative pricing strategies in order to boost their operations and 
bottom line. In the past, e.g. grocery, drug or fashion apparel stores would fix a product’s 
price over a relatively long time period and mainly focus on their inventory management in 
order to obtain a better match between supply and demand. This static pricing strategy was 
mainly due to the lack of information about their customers’ taste, willingness to pay and 
the fact that high transaction costs - so-called menu costs - were associated with changing 
prices. Driven in large part by advances in information technology and e-commerce, a more 
sophisticated approach of changing a product’s price found its way into retail and manufac- 
turing industries. Here, the seller changes prices dynamically over time, based on factors like 
demand information, supply availability, production schedules and the time of sale. With the 
goal of balancing demand and supply, dynamic pricing methods were first applied by indus- 
tries where the short term capacity is hard to change, such as airlines, hotels, cruise ships, etc. 
(see Talluri and van Ryzin (2004) for more detail). Nowadays, the business model of dynam- 
ically changing the prices of a product is an important revolution in retail and manufacturing 
industries and is already strongly practiced by e.g. Dell Computers and Amazon. There is 
growing understanding that both pricing as well as replenishment decisions are essential for 
increasing a firm’s profitability and thus should be coordinated. Nevertheless they are tra- 
ditionally mostly determined by separate functional areas of a company’s organization: the 
marketing department sets prices, the market determines the quantity demanded, and the lo- 
gistics unit produces the quantity demanded. However, research work such as Whitin (1955) 
has already shown that the simultaneous determination of price and ordering or production 
quantity can yield substantial revenue increases. The coordination of price decisions and 
other aspects of the supply chain such as production and distribution is thus not only useful, 
but also essential. Coordinating these decisions means optimizing the system rather than its 
individual elements and not only potentially increases profits but also reduces variability in 
demand or production, resulting in more efficient supply chains. Enabled by powerful IT 
systems that can store and estimate thousands of demand models and compute integrated 
optimal policies today, reengineering efforts are being initiated in many companies to elimi- 
nate the organizational barriers between distinct functional areas within the same enterprise 
by creating new entities with such designations as "Revenue Management’, 'Dynamic Pric- 
ing’ or 'Smart Pricing’. 
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14 CHAPTER 1. INTRODUCTION 
1.2. Research intention 


Looking at the state-of-the-art methodological literature, we find that relevant work divides 
into two rather distinct streams: The operations oriented stream (see chapter 2) and the 
marketing oriented stream (see chapter 3). Eliashberg and Steinberg (1993) give a nice com- 
parison of the two streams: Operations management (or production management) deals with 
organizing and controlling the direct resources to produce the goods and services provided 
by an organization to customers. Marketing in contrast deals with the process of planning 
and executing pricing, promotion and distribution of goods and services in order to create 
exchanges that satisfy individual and organizational objectives. The interface between mar- 
keting and operations management is being recognized as a legitimate research domain and 
has experienced increased emphasis in the past. Nevertheless, as already stated above, in 
most firms the marketing and production functions are organizationally separate. A possible 
explanation could be that marketing is typically concerned with revenue maximization by 
setting prices and advertising policies. Here, relatively realistic demand models are being 
used, which for example account for intertemporal demand correlations by incorporating 
both current price and reference price, which is formed on the bases of past purchases. How- 
ever, they underlay a rather simplistic cost structure which does not account for supply chain 
management interactions by e.g. assuming stationary variable costs. Operations manage- 
ment is typically concerned with cost minimization, meaning that production is required to 
produce the needed output at minimum costs. Thus rich cost models, well describing a firm's 
possible cost structure, are being used. Costs are assumed to be non-stationary, which means 
that they can vary over time and fixed costs can be in included in the model. Furthermore, 
production decisions are integrated in the model (not only pricing but also inventory deci- 
sions), which is not the case in purely marketing-orientated work. The limitation of these 
models is that they rely on rather simplistic demand assumptions. Demand is, for example, 
modeled as a function of the current price only. In any case, both prevalent research streams 
consider only a partial picture of the relevant system. Typically, a coordinated decision- 
making problem results in better performance of the system. The magnitude of the improve- 
ment depends on how the objective functions are defined for the two separate departments 
and which department is assumed to act first. 


Identifying this prevailing research gap leads us to address the problem of simultaneously 
determining a pricing and inventory replenishment strategy by combining these two literature 
streams described above: we want to take the rich and non-stationary cost models commonly 
used in operations research and combine them with demand models, which account for in- 
tertemporal demand correlation and so far have been mainly applied by marketing. Both 
price and ordering quantity are to be dynamically adjusted according to the prevailing in- 
ventory, the consumers' willingness to pay and the remaining length of the finite selling 
horizon. The integration of reference price effects with inventory control models has not 
been reported so far in literature. Hence, by developing such an integrated inventory control 
and pricing model, we will probe into the issue of whether using a reference price model to 


describe demand will significantly increase the benefits of integrating Eco and logistic 
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1.3. STRUCTURE OF THE THESIS 15 


decisions and when it makes sense to apply such models. In this thesis we generally focus on 
linear demand models, which are detrended and seasonally adjusted. Furthermore, we only 
consider monopolistic pricing, ensuring mathematical tractability. These assumptions are 
not unrealistic, because price optimization by a firm is only possible in imperfect markets. 
In the case of monopolistic competition a firm faces a range of prices where competitors do 
not react. The linear demand function is a local approximation conditional on competitor's 
prices which remain unchanged if the price stays within this permissible range (see Phillips 
(2005), Chapter 1). Not only are these models important in retail, where price-dependent 
demand plays a significant role, but also in manufacturing environments with a different un- 
derlying cost structure, in which production and distribution decisions can be complemented 
with pricing strategies in order to improve the firm's bottom line. Within this work we are 
going to examine how the additional dynamics affect an optimal policy and whether variants 
of a simple policy such as a base-stock-list-price policy still hold in such a setting. Further- 
more, we are going to find conditions under which it is possible to show analytically the 
existence of a unique optimal solution. We want to state here that the main focus of this dis- 
sertation is a mathematical analysis, which justifies that most problem definitions are taken 
from literature. However, we will still try to motivate an economic understanding of dynamic 
market models and supply chain decisions, wherever possible. Via numerical simulation we 
shall explore the size of potential benefits of such models, as well as how optimal policies 
evolve over time and how optimal solutions vary with changes in the model parameters. 


1.3. Structure of the thesis 


We will here give a short outline of the structure of this thesis. Chapter 2 and chapter 3 are 
devoted to a brief review of the current state-of-the-art literature, relevant to this work, as 
well as some minor new results. The main new results will be presented in chapters 4 to 6. 


Chapter 2 gives an overview of the models used in operations research so far. For di- 
dactical reasons we first introduce the theory of solely inventory control models in section 
2.2, which are then expanded to the multi-period setting in section 2.3. For each of the two 
sections, we first focus on one-period models, which are then extended to the multi-period 
setting. We not only present the well known critical fractile solution for the classical lost- 
sales version of the newsvendor problem, but also adapt the solution to the backlogging case 
including inventory holding and backlogging costs. Furthermore, the base-stock-list-price 
policy is introduced in chapter 2 and shown to be optimal for the most commonly used de- 
mand models. We also provide a steady-state solution for the joint pricing and inventory 
control model in subsection 2.3.3, which has not been seen in literature so far. 


Chapter 3 is devoted to marketing models that mainly focus on price optimization. The 
concept of reference price effects is introduced and structural properties of the optimal solu- 
tions are given for loss-neutral and loss-averse customer behavior. We show by a numerical 
example that for loss-seeking customer behavior, the optimal solution does not converge and 


thus a cycling pricing policy is optimal. As in chapter 2, we provide a steady-state solution 
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16 CHAPTER 1. INTRODUCTION 


for the case of non-zero proportional ordering costs, which is an extension to the solution 
found by Popescu and Wu (2007). 


In chapter 4 we combine the two models presented in chapter 2 and chapter 3 and introduce 
an integrated model including reference price effects, which will lay the foundation for the 
rest of this work. 


Chapter 5 is dedicated to an analytical analysis of the model introduced in chapter 4. This 
chapter consists of three parts: the one-period case, the two-period case and the multi-period 
case. For the one-period case in section 5.1, we can prove the optimality of a base-stock-list- 
price-policy and provide implicit solutions for the optimal price and stocking quantity with 
respect to reference price under very general conditions. However, it is not so easy to extend 
this property to a multi-period setting. By integrating the solution of the one-period case into 
section 5.2, we find that for the linear demand a base-stock-list-price policy also holds for 
the two-period case. The mathematics behind this result is extensive and tedious, which is 
why we chose to present purely the technical results in the appendix A. In section 5.3 we 
prove the optimality of a base-stock policy under rather restrictive assumptions. Adjusting 
the proof technique for a more general setting is definitely worthwhile considering for further 
research. 


Chapter 6 is devoted to simulations and numerical investigations. By the means of numer- 
ical optimization, in section 6.1 we extend the results from section 5.2 to the multi-period 
setting for the special case of linear demand and loss-neutral customer behavior. We further- 
more investigate the influence of different demand distributions and coefficients of variations. 
In section 6.2, we study the potential increase of profit by simultaneously determining opti- 
mal prices and stocking quantities compared to a sequential optimization, where prices are 
set first by the marketing department of a company and then the production unit decides on 
the optimal stocking quantity, without being able to change prices. In section 6.3, we provide 
some numerical results for loss-averse and loss-seeking customer behavior and the case of 
non-zero fixed ordering costs. 


The last and concluding chapter 7 of this thesis provides an overview of conclusions and 
recommendations for further research. 
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2. Models in Operations Research 
Literature 


Operations research has a significant impact on inventory management in recent decades. 
The theory of inventory management deals with the management of stock levels of goods, 
with the intent of effectively meeting demands for those goods. Traditional inventory models 
(see sections 2.2.1 and 2.3.2) assume that a commodity’s price is exogenously determined 
and thus only address the two fundamental issues: when should a replenishment order be 
placed, and what quantity should be ordered. Hence the objective is to minimize costs. Re- 
cent developments in the area of revenue management have demonstrated that major benefits 
can be derived by complementing a replenishment strategy with the dynamic adjustment of 
the the commodity's price (see sections 2.2.2 and 2.3.3). Since demand for a product varies 
as a function of price in practice (see e.g. Phillips (2005)), the objective therefore changes 
from minimizing costs to maximizing profits under dynamic pricing strategies. In the pres- 
ence of demand uncertainty, a common approach for risk neutral companies is to minimize 
expected costs or maximize expected profits. Alternative risk averse approaches using e.g. 
Value at Risk measures instead of expected values can be found in literature, but are not the 
focus of this thesis. The complexity of the model depends on the assumptions, one makes 
about demand and the underlaying cost structure. 


According to Porteus (1990) and Lee and Nahmias (1993), there are several reasons for 
holding inventories: The key motive is definitely to hedge against uncertainty in the face 
of stochastic demand. Holding stocks in response to this unpredictable variability means 
higher holding costs but lower shortage costs, which are in general significantly higher than 
holding costs. Moreover economies of scale are an important reason for keeping inventories. 
Economies of scale occur when there is a fixed setup cost (e.g. setup time, changeover time, 
etc.) for each order that does not depend on the lot size and often arises when there are quan- 
tity discounts or learning. Last but not least, it may be advantageous to retain inventories in 
anticipation of a price rise. Inventories may also be stockpiled in advance of sales increases. 
If demand is expected to raise, it may be more economical to build up large inventories in ad- 
vance, rather than to increase production capacity at a future ime. However, large build-ups 
of inventory are often a result of poor sales. 


For a good overview and recent reviews on inventory models, we refer the reader to Por- 
teus (1990), Lee and Nahmias (1993), Zipkin (2000), Eliashberg and Steinberg (1993), El- 
maghraby and Keskinocak (2003), Chan et al. (2004) and Simchi-Levi et al. (2005). 
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2.1. Problem description 


Consider a retailer or manufacturer who maintains an inventory of a particular product. Since 
customer demand is random, the decision maker only has a vague idea about the actual 
demand occurring at a given time. This information is described by a probability distribution 
of demand. Depending on this knowledge, the retailer or manufacturer has to decide at what 
point to reorder or produce a new batch of products and in succession how many items of 
the product the batch should comprise. Typically, such reordering decisions involve two 
different kinds of costs: a fixed amount, independent of the size of the order (e.g. cost of 
sending a vehicle from the warehouse to the retailer), and a variable amount proportional 
to the number of products ordered. In the face of uncertainty about the actual demand, this 
decision will generally lead to over- or underproduction, with resultant excess inventories 
incurring unnecessary holding costs (typically accruing at a constant rate per unit of product 
by unit of time), or inability to meet consumer needs, respectively. 


The literature shows two ways of coping with unmet consumer demands: either the lost 
sales case where demand that cannot be met immediately is lost forever, or the backlogging 
case, where demand for the product in excess of the amount stocked will be backlogged. This 
means that these customers will return next period for the product, in addition to the usual 
(random) number of customers who generate demand then. The inability to meet consumer 
needs when they occur results in potentially long term loss of customers for which artificial 
penalty costs called backlogging costs will be charged. The decision maker has to determine 
an optimal inventory policy to minimize the expected cost of ordering and holding inventory. 
In some situations, especially the one of interest in our work, the price at which the product 
is sold to the customer is also a decision variable. In this case demand is not only random 
but is also affected by the selling price. The retailer's or manufacturer's objective is thus 
to find an inventory and pricing strategy maximizing expected profits over the length of the 
planning horizon. 


In this work we mainly focus on the retailing environment, where inventory decisions 
represent ordering decisions. However, the same argumentation can be expanded to the 
manufacturing setting, where inventory decisions become procurement decisions under a 
different cost structure, respectively. 


2.2. One-period models 


2.2.1. Inventory control 


The one-period model, mostly called the newsvendor model in the literature, is the basis for 
most discrete time stochastic inventory models. It applies when the product's useful life is 
only one planning period and the product becomes obsolete at the end of this single period. 


This would be the case when a product perishes or spoils quickly, such as fresh produce (e.g. 
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eggs, refrigerated orange juice, diary items), certain short-lived style goods such as many 
fashion style and seasonal goods, or newspapers (hence the name 'newsvendor' model). 
These models are more interesting for their structural importance than their applicability. 
However, dynamic, multi-period models depend critically on them. 


An unknown quantity D of a single product will be demanded during a single period. 
While the product-specific distribution of demand is known - we denote its cumulative dis- 
tribution function by ®(-) - the actual number of units demanded will not be known until 
after the decision y, which denotes the order-up-to level. Suppose the initial inventory on 
hand at the start of the period is x > 0, then the order quantity is given by y — x. Under 
most commonly used demand distributions, it is impossible to always be assured of meet- 
ing all demand, so the prospect of unmet demands must be accepted. For the newsvendor 
model, unsatisfied demand is typically assumed to be lost, for which shortage costs s > 0 
are allocated. The tradeoff in such a model is between ordering too many and too few. If we 
order too many, then we have some leftover y — D if y > D, and we paid for more than we 
need. If we order too few, then we could have sold more if we had bought more. Additional 
information available to the decision maker includes the proportional (variable) production/ 
ordering costs c , incurred for each item ordered, the selling price p and the per unit salvage 
value v (or salvage cost if v « 0, respectively), incurring for each unit purchased but not 
sold. Clearly theses variables should satisfy 


p>c>u. (2.2.1) 


otherwise the problem can be solved trivially. A brief summary of the introduced notation is 
given in table 2.1. 


Since demand is a random variable, and we consider a risk-neutral decision maker, the 
decision of how many units to order is based on expected profits which need to be maximized 
when solving for optimality: The stochastic profit, II(y. D) can be described as 


pD-puey-D eyex.y Day 
py-sUDgdyedyezxh x 


which, by u denoting each actual demand realization, yields the expected profit 


Il(y, D) = 1 (2.2.2) 


y x 
E(I(y. D) = / [pu + u(y — u)] é(u)du + l [py — s(u — y)] é(u)du - cly — 2). 
-x y 
(2.2.3) 


By adding and subtracting the quantity pf uó(u)du to F|[II(y. D)], we get 


x 


Ey. D)] = pE[D]+(p+s) / (y u)ó(u)du-v f " (y—u)olu)du—ely—2). (2.2.4) 
; : 


where E[D| denotes expected demand. We now differentiate E[II(y. D)] with respect to y 
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D_D=0 Fan) demanded in te period andom ar) — 


e=D-EID|, perturbation of demand (random variable) under ad- 


Ele] = 0 ditive demand uncertainty with mean 0 


F(u) = Ö(E[D]+u) | cdf of demand perturbation 
F(y) = f*.. f(u)du | probability density function of demand perturbation 


fp [p> =i TT RN 
oa 


b>(1-Y)c per unit backlog penalty costs 
per unit holding costs 


Table 2.1.: Notation: Newsvendor models 


(by applying Leibniz integral rule) and set this equal to zero: 
d 
„ru. D)| = (p + s)l1 — 9(y)] + v®(y) — c — 0, (2.2.5) 
which leads to the optimal inventory level after ordering y*: 


y =o! (== ud j (2.2.6) 


if x < y* . Otherwise it is optimal not to order. Such a policy yields a global maximum for 
the expected profit, since the selling price p being greater than the salvage value v and the 
unit shortage costs s being nonnegative result in the expected profit being strictly concave: 


d? 
5 Elly, D)| = -lp + s — vjéy) < 0. 
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Optimal inventory after ordering y 


(E m em 


* 


Inventory before ordering x 


Figure 2.1.: Base-stock policy 


From this concavity property it becomes clear that y* is the unique optimum when z « y*. 
Moreover, concavity in y ensures that it is optimal not to place an order ( y* = r)if x > y*, 
because the expected profit is strictly decreasing for any y > x > y*. In the literature, such 
a policy is often called a base-stock policy. 


Definition 2.1 (Base-stock policy). À base-stock (order-up-to) policy is characterized by an 
order-up-to level, often referred to as the base-stock level S*. If the initial inventory level 
before ordering z is below the base-stock level, an order is placed to raise the inventory 
level up to the base-stock level. Otherwise, no order is placed (figure 2.1 gives a graphical 
description of such a policy): 


y'(x)- 027) 


S" ‚des 
Xo 4». 


For later investigations, some slightly different notation will be convenient, which we thus 
introduce here: When uncertainty in demand is modeled additively as 


D(e) = E[D(e)] + €. (2.2.8) 


where e denotes the random perturbation of demand with mean zero and a cumulative dis- 
tribution function F(c), then it becomes clear that ®(D) = F(D(e) — E[D(e)]). The 
following theorem thus follows directly from equation (2.2.6). 


Theorem 2.1. Let demand D(e) be modeled additively, such that D(e) = E[D(e)]--e, with 


E|D(e)| being the mean demand and e a random variable with mean zero and following 
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a cumulated distribution function F(-). Then for the lost sales case an order-up-to policy 
with the optimal base-stock level S* is given by 


AfPprs—c 
S* =F! |1 | + E[D(e)), 2.2.9 
(==) + Be). 2.29) 
where p denotes the unit selling price, c the unit ordering costs with 0 < c < p, s 2 0 the 
unit shortage costs, and v « c the unit salvage value. 


Remark 2.1. Cu = (p + s — c) represents the opportunity cost of underestimating demand 
and C, = (c — v) the cost of overestimating demand. The above ratio C,/(C,, + Co) in 
equation (2.2.9) is known as the critical fractile. Intuitively, it corresponds to the safety factor 
at which the expected profit lost from being one unit short is equal to that from being one 
unit over. 


Logistics employs a different model if demand in excess of the amount stocked is back- 
logged. In this case, customers will return after the end of the period where there is one more 
chance to place an order for the outstanding items, which are then instantaneously delivered 
to the customers (see e.g. Porteus (1990) for newsvendor models with partial backlogging or 
Khouja (1996) for newsvendor models with an emergency supply option). But, at the same 
time, backlogging costs b > 0, are charged as penalty costs for the inability to meet con- 
sumer needs when they occur. In case of overproduction with resultant excess of inventories 
at the end of the period, holding costs h > 0 occur. These could be interpreted as carrying 
charges until the remaining items can be sold, e.g. to a discount store for some salvage value 
v. Holding and backlogging costs are charged for the period when they occur, whereas any 
financial flow after the end of the period (reordering/salvaging opportunity) is discounted by 
a discount factor 0 < y < 1. A brief summary of the newly introduced variables is given in 
table 2.1. To insure that it is not optimal to not order anything at all and merely accumulate 
backlog penalty costs, b > (1 — y)c is also assumed. Note that since holding costs and 
salvage value always occur together (h — yv ) and backlogging costs are always associated 
with ordering costs after the end of the period (5 + yc), they both could be integrated in 
one variable each, which is usually the case in literature. However, we here choose to keep 
them both in preparation for the multi-period inventory model. Maximizing expected profits 
yields the following theorem. 


Theorem 2.2. Let demand D(e) again be modeled additively, such that D(e) = E[D(e)] + 
e, with E|D(e)] being the mean demand and € a random variable with mean zero and 
following a cumulated distribution function F(-). Then for the backlogging case an order- 
up-to policy with the optimal base-stock level S* is also given by 


af Pelle 
S= Fol | .-————— E|D(«)], 22.10 
(es) + BDU! 2.2.10) 
where p denotes the unit selling price, c the unit ordering costs with 0 < c < p, y the 
discount factor with 0 < y < 1, b > (1 — y)c the unit backlogging costs, h > 0 the unit 


holding costs and v < c the unit salvage value. — 
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Proof. Let u denote the realization of the random variable e and differentiate the expected 
one period profit 


Pa 
E[Il(y. €)) = pE(D(9)] - cly — 2) - (h- v) J (y — EID(e)] - u) f (u)du 


x 


er f  (E[D(O) - u - y)flu)du 
y- E[D(«)] 
(2.2.11) 


with respect to the inventory y. This leads to 


aE|I(y.e) _ 


Bp = ce- hw) Fy EIDO- (0470) F( E[D(]) (6-0). (2.2.12) 


Setting this equal to zero and solving for y gives the desired result, since with c > v, y > 0 
and k.b > 0, it follows that the expected profit E|[II(y. €)| again is strictly concave in y 
(d E[Il(y. €)]/dy* = —(h +b + ye ~ v)f(y - E[D(e)]) < 0). " 


Remark 2.2. Note that the above result can only be obtained for backlogging costs b being 
independent of price p. 


Remark 2.3. The critical fractile (b— (1—y)c)/(h--b—y(v—c)) has a similar interpretation 
as in equation 2.2.9: it corresponds to the order quantity at which the expected profit lost from 
being one unit short is equal to that from being one unit over. Here C, = (b — (1 — ¥)c) 
denotes the opportunity cost of underestimating demand and C, = (h + c — yv) the cost 
of overestimating demand. The above ratio is again given by C,/(C,, + Co). Note that in 
the backlogging case p does not appear in the critical fractile. This is because the items are 
sold in any case. Today, the situation characterized by formula (2.2.10) is more prevalent in 
practice, as due to competition, firms are more willing to incur substantial backlogging costs 
than to lose customers in the future due to unsatisfied demand. 


2.2.2. Joint pricing and inventory control 


We now apply the newsvendor problem to analyze firms who jointly set a selling price and 
a stocking quantity prior to facing the random demand in a single period. Such an extended 
model incorporates the price as a decision variable which provides an excellent vehicle for 
examining how operational problems interact with marketing issues to influence decision 
making at the corporate level. One of the first attempts to address marketing-production 
joint decision making was presented by Whitin (1955), who formulated a newsvendor model 
with price effects. He adapts the model described in the section 2.2.1 in such a way that the 
probability distribution of demand depends on the selling price, where price is a decision 
variable rather than an external parameter, and found that a gain can be achieved by more 
closely coordinating marketing and logistics. A good survey on price setting newsvendor 


models can be found in Petruzzi and Dada (1999). We now redefine the additive demand 
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from equation (2.2.8) in such a way that the mean demand is now a function of the selling 
price: 

D(p,e) = E[D(p, e)] + €, (2.2.13) 
whereby in the following expected demand is assumed to be a non-negative linear function 


E|D(p,c)| = fo + fip, with Bo > 0 and 6j < 0. Since in contrast to equation (2.2.3) 
u here describes the realization of the random variable c , each actual demand realization is 
now given by u+ E[D(p, «)] . By furthermore defining the safety stock z = y — E[D(p, e)] , 
the goal function from equation (2.2.3) becomes 


Einis.p.o] - f " (PELD(p, €)] + u) + v(2 — u))f(u)du- 


E f cee. e)] + z) — s(u — z))f(u)du — c(E[D(p, e)] + z — x). 
(2.2.14) 


Petruzzi and Dada (1999) show by simple differentiation (as we did in section 2.2.1) that 
E|[ll(z, p,c)] is concave in p for a given z, which guarantees that the sequential process 
of first optimizing p fora given z and then searching over the resulting optimal trajectory 
to maximize E|II(2, p(z),€)| in the safety stock z yields the optimal solution. In this case 
the optimal price for the integrated problem is given by (see Petruzzi and Dada (1999) for 
details) 


e(z) 
* 0 
z2)—-p +=, (2.2.15) 
Pep 
where p? denotes the optimal riskless price 
0 Bo — Bie 
= — (2.2.16) 
20; 


which is obtained by differentiating the marketing goal function (p — c) E[D(p,e)] with 
respect to p and setting the result equal to zero (Phillips (2005), Chapter 1)). Furthermore, 
Q(z) = f? (u — z)f(u)du denotes the expected lost sales when a safety stock z is chosen. 
Since 8, < 0 and O(z) is nonnegative this theorem follows: 


Theorem 2.3. In the lost sales case the optimal risk less price p? is higher than the optimal 
price p* incorporating risk. 
pap. 


Remark 2.4. In the integrated setting the price is used to reduce the coefficient of variation 
of demand, and the difference between the optimal price set by marketing in isolation is 
decreasing with increased price sensitivity (slope of the demand function £ ) and demand 
uncertainty. However, as Petruzzi and Dada (1999) show, this effect is reversed if random- 


ness is modeled in a multiplicative way as D(p, c) = E|D(p,e)le. 
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Let us now look at the backlogging case, which so far did not attract interest in literature. 
By again defining z = y — E|D(p,e)] as the safety stock, the goal function from equation 
(2.2.11) becomes 


E|ll(z. p. €)| = pE|D(p. €)] - e(EID(p. €)] + 2 - 2)— 


ž = (2.2.17) 
— (h — w) f (z — u)f(u)du — (b 4 ve) | (u — z)f(u)du. 
-x z 
It is easy to see that the price no longer influences expected holding or backlogging costs. 
This differs from the lost sales case, where we lose sales if demand exceeds stock and hence 
expected revenues are reduced by p LG — u)f(u)du (compare equation (2.2.4)). Differ- 
entiating the expected profit (2.2.11) with respect to price yields 


OE |(z, p.c 
PEE! — ElDty. 0) + (p — 9E o) (22.18) 
where E[D,(p. €)] = ores Setting (2.2.18) equal to zero and solving for p leads to the 
following theorem. 


Theorem 2.4. In the backlogging case, the optimal price p* for the integrated problem is 
the same as the optimal price p? obtained by the sequential method: 


" o D» - fic 
200 /—— 


Remark 2.5. In the backlogging case, we can set the price independently of the inventory 
decision. Thus, while the sequential approach is not optimal in the lost-sales case, no gain 
is achieved by joint optimization in the backlogging case, which is, as already stated above, 
more prevalent in practice. 


2.3. Multi-period models 


We are now ready to consider the finite horizon multi-period version of the problem setting 
described in the last section, which was first introduced and solved by Arrow et al. (1951). 
The backlogging version of the system described in section 2.2 will now be operated over 
T periods. What makes the problem more complicated than solving T copies of the single- 
period problem is that any leftover stock at the end of one period is retained and can be 
offered for sale the following period (see figure 2.2 for a sample inventory path). The in- 
ventory level z; is reviewed at regular intervals (e.g. each week or month), an appropriate 
quantity y, — x; is ordered and a per unit selling price p, charged after each review at the 
beginning of a new period t. For easier tractability and clarity of the formulas we assume 


that all input variables are stationary and thus not anticipated to change over time (most of 
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the results presented in the following also hold in the time variant case). Each unit of positive 
leftover stock at the end of each period incurs holding costs h. If the demand exceeds the 
inventory on hand, then the additional demand is backlogged and is filled when the addi- 
tional inventory becomes available - the backlogged units are viewed as negative inventory. 
This means that these customers will return the next period for the product, in addition to the 
usual random number of customers. A per unit backlogging cost b, b > 1 — yc is charged 
as a penalty cost. The newly arising demands D, in different periods are assumed to be 
statistically independent and identically distributed according to general stochastic demand 
functions as in the above section. The ordering cost function includes both a per unit variable 
cost c and a fixed setup cost k, which is incurred if an order is placed ( y, > x), regardless 
of the size. If no order is placed, no setup costs are incurred. Orders placed are essentially 
received immediately (received in time to meet demand that arises in that period). All costs 
are expressed in beginning-of-period cash units; cash flows occurring in subsequent time 
periods are discounted by a one-period discount factor y € (0,1]. After the last period, 
the remaining inventory is salvaged at a per unit salvage value v or backlogged demand is 
satisfied and thus a final order is placed. For an overview of the variables introduced in this 
section we refer the reader to table 2.2 at the end of this section. The objective of the dy- 
namic version of the backlogging inventory model is to maximize total expected discounted 
profits V(z,), when the initial inventory on stock before ordering at the beginning of the 
planning horizon is zı : 


T 
V(zi) = 2 4 max (p.E[Dy] — cely — ze) — KO (ye — te) — G(yo p) + Y' Lern), 


yx, (pe) 

(2.3.1) 
with ó(u) = 1, if u > 0 and ó(u) = 0, otherwise. Moreover, G(y;, pe) denotes the 
expected inventory holding/ backlogging costs in period t and L(xr,1) the salvage value/ 
reordering costs at the end of the planning horizon: 


G(y, pe) = E|h max(y, — Di, 0) + b max(D, — yz, 0)}, (2.3.2) 
L(z74+ı) = v max(zr+1, 0) + cmin(zr+1, 0). (2.3.3) 


If the demand distribution functions are discretized, according to Jung et al. (2004) the 
evolution of demands over time can be represented by a tree-like structure (see figure 2.3). 
Starting from each node, there can be several possible demand realizations, expressed as 
branches stemming from that node. Assuming m possible next-period demand realizations 
at each node, the total number of scenarios will amount to m? . At each period t each node 
is associated with the realization of demand, the decision variables and the state variables. 
Complete enumeration would amount to an exponential complexity of O(m7), where O(-) 
denotes the Big O notation, which describes the runtime complexity of an algorithm. There- 
fore a stochastic dynamic programming approach with the significantly lower complexity of 
O(Tm) is described in the following to model the planning process as it reacts to demand 
realizations unfolding over time. 
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Figure 2.2.: Inventory path over time 


2.3.1. Dynamic programing formulation 


Dynamic programs deal with situations where decisions are made in stages. In a stochastic 
setting dealing with random parameters, the outcome of each decision is not fully predictable 
but can be anticipated to some extent before the next decision is made. The objective in 
dynamic programs is either to minimize or to maximize a certain value which can be expected 
costs or profits, respectively. A key aspect of such environments is that decisions cannot be 
viewed in isolation since one must balance the desire for high (respectively low) present 
values against the undesirability of low (respectively high) future values. The technique of 
dynamic programming captures this tradeoff. At each stage it ranks decisions based on the 
sum of the present values and the expected future values, assuming optimal decision making 
for subsequent stages. The states of the system summarize past information that is relevant 
for future optimization. 


The principle of dynamic programming was popularized by Richard Bellman in the forties 
and is to decompose such a complicated problem into a sequence of equivalent single period 
problems. One need only specify the optimal value of starting the next period (as a function 
of the starting state) and continue over the remainder of the planning horizon as the 'salvage' 
value function. In the case of dynamic models, it usually amounts to working backwards. 
A good review on how stochastic dynamic programming models, also referred to as Markov 
decision processes or stochastic control problems, apply to economic literature can be found 
in e.g. Stokey et al. (1989), Puterman (1994), Porteus (2002), Miranda and Fackler (2002), 
Heyman and Sobel (2004) and Bertsekas (2005). 


Dynamic programing using backward recursion will be an appropriate technique for solv- 


ing the above multi-period maximization problem. Thus, equation (2.3.1) reformulated in 
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time period 
time period 
eius 
time period 
Figure 2.3.: Evolution of sample paths 
terms of dynamic programming becomes: 
Vilt) = max {Ji(r2, Y Pe)} (2.3.4) 


yı>2:,(pı) 


Jil Zi; Yt, Pt) = pEID,] — cy - 74) = kó(yr ex Zt) — G (Yt, Dt) + yE V (Ye — D;)), (2.3.5) 


where the value function V;(x) denotes the maximum expected discounted profit for periods 
t,...,T (profit-to-go function) when starting period ¢ with initial inventory level x, and 
Vrai = L(zr41) = v max(zp41,0) + cemin(zr,1,0)]. Note that again G(y%, py) denotes 
the expected holding/ backlogging costs G(y;, pe) = E[h max((y — Dı, 0) + bmax((D; — 
y)),0)]. Equation (2.3.4) describes the system in the state 2;, the inventory level before 
ordering, and the action(s) y;, the inventory level after ordering (and the price p,). Note 
that the admissible action space is restricted to [;, oo) , since only nonnegative orders are 
permitted. State and action (decision) variables are related via the following transition func- 
tion: 

Tt+1 = Ute — D. (2.3.6) 


Equation (2.3.6) gives the gross quantity of stock on hand at the beginning of period t, which 
equals the inventory on hand after ordering at the beginning of the previous time period less 
the total quantity actually sold during that period (see figure 2.2). A brief idea of the system 
dynamics is given in figure 2.4. 


In the study of stochastic dynamic programming models, researchers often attempt to 
establish certain structural properties of the value function in the state variables, like for 
instance monotonicity, convexity or supermodularity. Properties such as convexity, can be 
enough to specify the general form of the optimal policy. Establishing the existence of op- 
timal policies with a special structure is of great practical importance, since they are highly 
appealing to decision makers, are easy to implement and enable efficient computation. In 


such cases specialized algorithms can be developed to search only among policies that have 
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Figure 2.4.: System dynamics 


the same form as the optimal policy, which speeds up computation time significantly. When 
the optimality of e.g. monotone decision rules is known, efficient backward induction algo- 
rithms (see Puterman (1994), section 4.7.6) can be developed by constantly restricting the 
action space. Furthermore, such properties canhelp in developing a qualitative understanding 
of the model by describing how the results will change with changes in the model param- 
eters. For a good review and general results on structural properties of stochastic dynamic 
programs we refer the reader to e.g Smith and McCardle (2002), Puterman (1994), Topkis 
(1998), Bertsekas (2005), Bertsekas (2001) and Heyman and Sobel (2004). 


The following two subsections will be devoted to review of some types of simple forms 
of optimal policies that have already been found in literature and also provide some intuitive 
understanding of the structural results, which will be useful for a better understanding of 
the integrated model in chapter 5 and 6. Furthermore, we will include a brief convergence 
analysis. 


2.3.2. Inventory control 


Bellman and Glicksberg (1955) were the first to show that the optimal total cost function is 
convex in inventory on stock before ordering for certain stationary assumptions, which means 
that a constant stock level is optimal (often referred to as base-stock level, see definition 2.1 
on page 21). Wagner and Whitin (1958) presented a nice forward algorithm for a solution 
of the dynamic version of the economic lot size model. In the following we give the reader 
an idea of how structural properties are maintained by induction from one time period to the 
next and so lead to a base-stock policy. Since in this section we focus on solely optimizing 
the inventory level y, in each time period, we let demand be exogenously given by D, — 
E|D(«)] + & (see section 2.2.1). Furthermore, in this thesis we consider only models where 


fixed ordering costs are not included (k = p We thus reformulate the dynamic program 
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(2.3.4) in the following way: 


Va) =maxihle,y)}; (2.3.7) 
J(z,y) = pE|D(e)] - ey - x) - G(y) + YE[Vi (y — D(e))] (2.3.8) 
with 
y—E[D(e)| 00 
G(y) = af (y—E|D(e)}—u) f (u) oun a MG Ve (2.3.9) 


and Vr,i(r) = v max(z, 0) + cmin(z, 0). 


To prove that a base-stock policy is optimal, we establish the following lemma, which can 
be found on page 525 in Heyman and Sobel (2004): 


Lemma 2.1 (Concavity preservation under Maximization). Jf X is a convex set, Y (x) a 
nonempty set for every x € X, C := {(z,y)|z € X,y € Y(z)) a convex set and g(x,y) a 
(jointly) concave function on C, 


f(z) = na g(x, y) 


and g(x,y) < oo for every x € X , then f is a concave function on X. 


Theorem 2.5. Let a multi-period inventory control model be given by the dynamic program 
defined in equation (2.3.7), whereby v = c is assumed and thus Vr,1(x) = cx. Then the 
following holds for any time period t — 1...T: 


I. J(x,y) is jointly concave in x and y. 
2. V(x) is concave in x. 


3. A base-stock policy with order-up-to level S7 is optimal in time period t. 


Proof. The proof follows the principle of induction. By applying Leibnitz' integration rule 
we find —G(y) to be concave in y: 


: s =b- (h+b)Fiy - EID(O)), (2.3.10) 
_ i = -(h+b)f(y- E[D(c)]) x 0. (2.3.11) 


Since —G(y) does not depend on x we can also say that —G(y) is jointly concave in z 
and y. Moreover, Vo(x) and the first term in equation (2.3.8) are trivially jointly concave. 
The second term —c(y — x) is linear in both variables z and y and thus jointly concave, 
too. Since any positive linear combinations of jointly concave functions are again jointly 
concave, it is now clear that Jr(z, y) is jointly concave. By knowing that ((z, y)|y > x} is 


a convex set, lemma 2.1 can be applied and therefore VW (z) is concave in x. The function 
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Figure 2.5.: Base-stock path over time 


Jr(x.y) being jointly concave furthermore yields the optimality of a base-stock policy (see 
section 2.2.1) for time period t = T. Assume now V;,;(r) is concave in x. By the same 
argumentation as used above for t = T, J,(x. y) is then shown to be jointly concave in x 
and y, which yields that V;(z) is concave in x. Thus we showed that base-stock policy is 
optimal for any time period £. [] 


Remark 2.6. A terminal value Vo(x) = cx means that leftover units at the end of the plan- 
ning horizon can be salvaged at same costs for which they were originally bought. This 
assumption is common in literature since it guarantees an easy analytical tractability. 


Figure 2.5 shows that for the finite horizon case with no salvage value (v — 0), the 
optimal base-stock level decreases over time. That is because towards the end of the planning 
horizon, since time remaining is getting shorter, the risk of not selling the inventory on stock 
increases, against which costs the decision maker hedges by a diminishing base-stock level. 
Of course there is no risk in the case where the per unit salvage value equals the per unit 
ordering costs (v — c). In this case, if some inventory on stock is not sold by the end of 
the planning horizon, it can be salvaged by the same amount of money as it was ordered. 
Here, a myopic policy which looks only at the single period backlogging problem described 
in subsection 2.2.1 is optimal in every period, regardless of the time horizon T (see Veinott 
(1965)). 


Theorem 2.6. If the dynamic program (2.3.7) admits a steady state, then it is given by 


St = E[D(e)] + Fo (CE) (2.3.12) 
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Proof. We can rewrite equation (2.3.1) for an infinite time horizon as follows: 


V(x) = » Som {pi E[D(cca)] — (yea 7 ze) — Glyen)} 


t0 yer 1231 
where x, denotes the starting inventory level at the beginning of the time-horizon. We now 
replace z;,1 = y — D(pi) for all t > 1 and then rearrange the sum in such a way that we 
are left only with terms indexed by t + 1 in the ¢-th summand: 


V(zi) 2 czi 9 y „max {PE LD (ee+1)] - el = wa + 3D) = G())- 
t=0 


141241 
2.3.13) 
As we assume that we are in a steady state we replace the time dependent y, by in time in- 
variant y. Since the profit function V (zi) is a concave function in y, the optimal inventory 
level after ordering y" can now be obtained by differentiating V(x) with respect to y and 
setting the result equal to zero. Using (2.3.10) we thus obtain 


ð 1 
ay eu = Ta -yY)e+b-(h+b)F(y- ED(p'))) =0, (2.3.14) 
which results in equation (2.3.12). O 


Remark 2.7. In most examples under consideration, a steady state is attained very quickly 
(in figure 2.5 the steady state is already reached in period 17, where T = 25.) Note that a 
possible steady state S*. is attained at the beginning of the planning horizon. 


Remark 2.8. The steady state base-stock level S% is increasing in both discount factor ^y 
and backlogging costs b, since F'^!(-) is increasing in these parameters. This is intuitive, 
because the seller, by keeping higher inventory levels, wants to hedge against higher back- 
logging costs, or reordering costs in a subsequent time period, respectively. Furthermore, 
a higher demand uncertainty also results in higher safety stock levels and hence in higher 
base-stock levels. Figure 2.6 gives an illustration of this correlation by varying one of the 
parameters h = 0.005,b = 0.4, y = 1,E[D(e)] = 45 and ø = 40 in each row (demand 
is assumed to follow a normal distribution with mean E[D(e)] and standard deviation o). 
Numerical results also show that for more heavy tailed distributed demands (like the beta 
or the log-normal distribution) base-stock levels are higher to prepare for the higher risk of 
large demands. 


Scarf (1960) and Veinott (1966) later extend the above theory to the case of nonzero fixed 
ordering costs. They prove that the optimal total cost function is k-convex! (under the as- 
sumption of convex holding/ shortage costs), inducing that the optimal policy in each period 
is an (s, S) -type policy: If the inventory level at the beginning of the period t is below the 
reorder point, s, , an order is placed to raise the inventory level to the order-up-to level, 5,. 
Otherwise no order is placed. Since we are not focusing on the case of nonzero fixed ordering 
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Figure 2.6.: Steady state base-stocks under various parameters 


costs in this thesis, we will omit further details here. Moreover, we just state for the matter of 
completeness that Zheng (1991) gives a simple proof for the optimality of an (s. S) -policy 
for the infinite horizon case, which does not depend on results of the finite-horizon problem 
(like an earlier proof conducted by Iglehart (1963)). 


2.3.3. Joint pricing and inventory control 


One of the first attempts to address marketing-production joint decision-making was pre- 
sented by Whitin (1955), who for a multi-period approach used a deterministic model. 
Thomas (1970) then extended the famous Wagner and Whitin (1958) forward algorithm 
to the marketing-production domain where price is included as a decision variable (still in 
a deterministic setting). In a subsequent paper, Thomas (1974) considers a stochastic ver- 
sion of his model. There he considers the problem of of jointly setting price and production 
levels in a series of T' periods, where price is modeled as a parameter in the probability 
distribution of demand. He is the first to formulate the problem as a dynamic program from 
which a optimal policy was derived numerically. Following the work of Porteus (1982) and 
Gallego and Van Ryzin (1994), Federgruen and Heching (1999) prove, assuming that the 
underlying demand function is linear and that the ordering cost is proportional to the amount 
ordered and thus does not include a fixed cost component, that a base-stock-list-price policy 
is optimal. That is, in each period the optimal policy is characterized by an order-up-to level, 
referred to as the base-stock, and a price which depends on the initial inventory level at the 
beginning of the period. If the initial inventory level is below the base-stock level, an order 
is placed to raise the inventory level to the base-stock level and the ordinary price (the list 
price) is charged. Otherwise, no order is placed and a discount price is offered, which is a 


non-increasing function of the initial Mie l 
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Since in this section we focus on a joint optimization of inventory level y, and selling 
price p, in each time period t, we again define demand additively as D, = E[D(p, e)] +6, 
whereby in the following expected demand is assumed to be a linear function E|D(p, e)] = 
Bo + Bip, with o > 0 and 0 < 0 (see section 2.2.2). As in subsection 2.3.2, we for easier 
tractability consider the special case of zero fixed ordering costs (k — 0). We thus define 
the underlying dynamic program as follows: 


Vx) = max (Jí(z, y. p)] (2.3.15) 


Az, y, p) = pE|D(p, &)] - cly - £) - Gly, p) +YElVirı(y - D(p,9) (2.3.16) 
with 
y-ElD(pe)] 00 
Gi.) =h f (W-E|Dp,o]-u)f(u)dutb | (EĻD(p, eu) flu) 
a y— E[D(p.c)] 
(2.3.17) 
and Vr,,(x) = v max(z, 0) + cmin(z, 0). 


Theorem 2.7. Let a multi-period inventory control model be given by the dynamic program 
defined in equation (2.3.15), whereby v = c is assumed and thus Vr,1(z) = cx. Then the 
following holds for any time period t — 1...T': 


1. J,(x,y,p) is jointly concave in x and y and p. 
2. Vi(x) is concave in z. 


3. A base-stock policy with order-up-to level Sf is optimal in time period t. 


Proof. The proof technique is identical to the one suggested in theorem 2.5, only that this 
time joint concavity in the two decision variables y and p is used to show the optimality of 
a base-stock policy. When comparing theorem 2.7 to theorem 1 in Federgruen and Heching 
(1999), note that there the optimal profit is reduced by the proportional costs of the stock 
on hand, cz, and thus concavity in x is trivially given, since then the value function before 
maximization J(y,p) no longer depends on x. L] 


We now turn our attention to the list-price property, which we define in the following: 


Definition 2.2 (Base-stock-list-price policy). A base-stock-list-price policy strongly relates 
to a base-stock policy (see definition 2.1). If the initial inventory level is below the base-stock 
level, an order is placed to raise the inventory level to the base-stock level and a so-called 
list-price is charged. Otherwise, no order is placed and a discount price is offered, which 
is a non-increasing function of the initial inventory (see figure 2.1 and 2.7 for a graphical 
description of such a policy): 


" (2.3.18) 
T * 
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Figure 2.7.: List-price policy 


Inventory before ordering x 


P” dp D 
* = l 2.3.19 
is M Eum ! 


where p*(r) € P* and p*(x) non-increasing in x. 


Definition 2.3 (Submodularity). A submodular/subadditive function is a function f(z. y) 
that has monotone decreasing differences, which means that for all r* > r^ € X and yt > 
y € Y the following holds: f(t, yt)+ f(z7.y7) € f(x*,y )-- f(z^.y*). As a conse- 
quence, if f(x,y) is differentiable, f(z, y) is submodular whenever Of (x. y)/(0xOy) < 0. 
f (x. y) is called supermodular/superadditive, if — f(x.y) is submodular. 


Theorem 2.8. Let a multi-period inventory control model be given by the dynamic program 
defined in equation (2.3.15) with v = c, then the optimal price p; (x) is non-increasing in 
x for any time period t= 1...T. 


Proof. To prove this theorem it suffices by theorem 8-4 in Heyman and Sobel (2004) to 
show that J,(r,y.p) is submodular in y and p (see definition 2.3) for any time period 
pel. 1. 


Since the sum of submodular functions is submodular, we need to show submodularity 
of each of the terms in 2.3.16. The first and second terms are trivially submodular since 
they depend only on one of the two variables y and p. In order to show that G(y. p) has 
monotone decreasing differences we define H(z) to a actual nn... costs in 


inventory level x after demand such that E[H(y — D(p. &))] = G(y. p). Note that since 
b.h > 0 the function H(z) is convex. To show that H(y Dip. er) is submodular i in y 
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and p we consider an arbitrary pair of inventory levels (y^, y*) and any pair of price levels 
(p^ ,p*) with y^ < yt and p. < pt. We furthermore define 


(2.3.20) 


From f, < 0 it follows naturally that z * > 2”. By convexity of H(-) we have: 


H(z^^)- H(z-^)- H(z*^ + yt -y ) — H(E*) > 


2 He ae yt EN y) = H(&) ES H(z*7) _ HGE). (2.3.21) 


Thus by definition 2.3, it is obvious that H(y — D(p,c)) is supermodular in y and p. 
Since taking expected values preserves the submodularity property, it is clear that —G(y, p) 
is submodular. Finally, the submodulanity proof for the last term in 2.3.16 is identical to the 
one of —G(y, p), as Vj,1(z) is concave in x by theorem 2.7. a 


Remark 2.9. The optimality of a list-price policy can be motivated by the intuition that hold- 
ing costs of unnecessarily high inventory levels x can be reduced by accelerating demand 
via reducing the selling price p. 


Similar to theorem 2.6, we can find a possible steady state for the joint pricing and inven- 
tory control model (2.3.15). 


Theorem 2.9. /f the dynamic program (2.3.15) admits a steady state, then it is given by 


x o Bo u Bic 
Doo = 28, ^ (2.3.22) 
* * un * —1 b ER (1 ~ u3 
S5 (3) = E[D(p5, oO) + F Ga ny (2.3.23) 


Proof. The optimal total discounted profit for an infinite time-horizon is given by (compare 
equation (2.3.1)): 


oo 
n 2 y max {Pen E[D(Peri, €t+1)] 7 ena 7 20) 7 G (Yeri: Peta)} - 
rar PHE Peri 


(2.3.24) 
Again, we rearrange the above infinite sum, replace the time variant variables y, and p, by 
their time invariant counterparts y and p, and obtain 


oo 
V(zi) = er +) Y [pE[D(p,9] - (0 — yy + yD(p,€)) -G(y,p]- (2.3.25) 
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Figure 2.8.: Base-stock and list-price evolution over time 


Since the profit function V(x) is concave in y (see theorem 2.6), the optimization problem 
in the two variables p and y can be reduced to an optimization problem in a single variable 
p. This can be done by first solving for the optimal value of y as a function of p and then 
substituting the result y* (p) back into V(z,) and solving in p. yž (p) can be obtained in 
the same manner as in theorem 2.6, with the only difference that now p also is a decision 


variable. For convenience we call y? = F^! iJ . We now use the optimal inventory 


after ordering y* (p) = E[D(p. €)] -- y? (compare theorem 2.6) as an input for y in equation 
(2.3.25). This yields 


V(z) = eri + $ ^Y [pE[D(p.6)] - «(ElD(». €)] + y°) - G((E[D(p.6)] + y°)-p)] . 


t=0 

(2.3.26) 
From equation (2.3.17) we have G(y*. (p). p) = h ft (y? —u) f (u)du--b Jj» tu-yP)flu)du, 
which is a constant and no longer depends on p. As V(z,) is concave in p, it suffices to 
solve 


ð OEID(p.e 

ap) = E[D(p. €)| + EM ~c)=0. (2.3.27) 
In the case of the linear demand function D(p, €) = Bo + Bip + €, equation (2.3.22) follows 
directly. g 


Remark 2.10. Note that the optimal steady state price (2.3.22) equals the optimal riskless 
price (2.2.16) from section 2.2.2. 


A sample evolution of the base-stock and list-price over time for Gp = 100. 0; = —20.c = 


0.5.6 = 0.4, h = 0.005, v = 0 and T = 15 is depicted in figure 2.8. One can see that while 
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Figure 2.9.: Price trajectories for different time periods 


the list-price stays constant over time (or is non-decreasing for other parameter settings), 
the tendency to lower inventory increases over time in the case of salvage value v = 0. 
Furthermore, it can be observed that at the beginning of the time horizon, a base-stock that 
is greater than the expected demand, is kept in order to hedge against expensive backlogging 
costs. Towards the end of the time horizon, the risk increases of not selling inventories 
on stock, which motivates decreasing base-stock levels. Note that in the last time period a 
negative safety stock S7. — E[D(pr,er)| < 0 is observed. 


Figure 2.9 shows the optimal price p*(x,) in inventory before ordering x, for different 
time periods and the above parameter setting. It is easy to see that the optimal policy is a list- 
price policy in any time period £. Furthermore, it can be seen that the tendency to give price 
discounts at lower inventory levels before ordering increases over time. This is intuitive 
since, as described above, the model aims at reducing inventory levels towards the end of 
the time horizon and a lower price for higher inventory levels results in a higher expected 
demand which then results in lower inventory levels after demand is realized. 


Chen and Simchi-Levi (20042) later extend the above described results for more general 
demand functions and the case of nonzero fixed ordering costs. They prove that the optimal 
profit function is k -concave (symmetric k-concave) and find that ( s, S, p)-policy is opti- 
mal. In such a policy the inventory is managed by the classical ( s, S )-policy and price is 
determined based on the inventory level at the beginning of each period. In a different paper, 
Chen and Simchi-Levi (2004b) investigate the infinite horizon problem. Since we are neither 
focusing on the case of nonzero fixed ordering costs nor on the infinite horizon in this thesis, 
we omit further details on those contributions here. For the convenience of the reader we 
give a brief overview of the notation used in this section in table 2.2. 
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T>1 length of the planning horizon 
T>t>i 


a 


time period 


unit(s) demanded in period t (random variable) 


& iid. perturbation of demand in period t (random vari- 
able) under additive demand uncertainty with mean 0 
F(-) 1 > F(u)20 cdf of demand perturbation 


F(y) = f°. f(u)du | probability density function of demand perturbation 


per unit sales price in period £ 


inventory level after ordering in period t 


inventory level before ordering 


R 


30€ | apa elpis 
~ 7T aS 
—— 


per unit production/ ordering cost 


fixed ordering (setup) cost 


per unit salvage value (or salvage cost if v « 0) 


12450 discount factor 
b>(1-Y)c per unit backlog penalty costs 
h h>0 per unit holding costs 


E 
is 


expected optimal profit-to-go function 


Table 2.2.: Notation: Multi-period models 
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3. Models in Marketing Literature 


3.1. Problem description 


Research in marketing demonstrates that in markets with repeated interactions, demand not 
only depends on the current price but is also sensitive to the firm’s pricing history and thus 
accounts for intertemporal demand correlations. The aim of these approaches is to assess 
optimal prices with respect to maximizing total expected profit, taking demand fulfillment for 
granted. Since consumers have a memory, the carrier of price is not only based on its absolute 
level, but rather on its deviation from some reference level resulting from the pricing history. 
As customers revisit the firm, they develop price expectations, which become a benchmark 
against which current prices are compared. A formulation that captures this effect is the so- 
called reference price, which is a standard price against which consumers evaluate the actual 
prices of products they are considering (see e.g. Winer (1986), Greenleaf (1995), Kopalle 
et al. (1996), Briesch et al. (1997), Fibich et al. (2003), Mazumdar et al. (2005), Natter et al. 
(2006)). If the price is below the reference price, the observed price is lower than anticipated, 
resulting in a perceived gain. This would make a purchase more attractive and raise demand. 
Similarly, the opposite situation would result in a perceived loss, reducing the probability of 
a purchase (people are less likely to buy products after prices have gone up). An important 
consequence of this reference price formation is that although frequent price discounts may 
be beneficial in the short run, they may be dangerous in the long run when consumers get 
used to these discounts and reference prices drop. The reduced price becomes anticipated and 
loses its effectiveness, whereas the non-promoted price becomes unanticipated and would be 
perceived as a loss. Thus, the optimal price policy becomes dynamic, with the reference 
price being the state variable. Popescu and Wu (2007) e.g. show that if the reference level is 
initially high, an optimizing firm will often consistently price below this level, which has the 
effect of a skimming strategy. Similarly, a low initial reference level leads to the optimality 
of a penetration type strategy. The reference price effect can be integrated into the demand 
model by modeling the reference price as the weighted sum of the previous reference price 
and the previous price set (exponential smoothing) and by adding an additional term to the 
response function where a positive reference gap (current price is lower than the reference 
price) increases demand, while a negative reference gap decreases demand (see equation 
(3.2.1)). 
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3.2. Model formulation and dynamic program 


E Ti 0 

p [nzo [o I 
Ele] = 0 demand Su with mean 0 

P ene Yn es 

ro" 


Table 3.1.: Notation: Marketing models 


The above described marketing model aims at setting optimal prices without considering 
inventory decisions by taking demand fulfillment for granted. As in chapter 2, we consider 
a finite horizon, stochastic, single item and periodic review model under a monopolistic 
setting. Demand perturbations in consecutive periods are independent and their distribution 
depends on the item's price and the consumer's reference price, which is based on the pricing 
history. Including reference price effects, we redefine the additive demand function used in 
chapter 2 in the following way: 


Definition 3.1. The stochastic demand is modeled by the piecewise linear function 
Dy (Pe, Tt, €) = Bo + Bi < pi Bo: max{p, — re, 0} 85: min(pi — re 0} €, (3.2.1) 


where r; denotes the reference pricing in period t, e, is iid. according to an arbitrary 
probability density function f(-) with mean Efe] = 0 and fy > 0 and 61, Bo, b3 < 0. 
Price p, and reference price r, are restricted to an arbitrary finite interval [p,p] and [r, T] 
such that E[D(p;, r;, €;)] 20 and p 2 c. 


Remark 3.1. Note that & > 0, Pı, 05, 83 < 0 ensure that the demand function is decreasing 
in price and increasing in reference price. Moreover, p < p, < p and r < r, < T guarantee 
that the expected demand is non-negative and finite. Furthermore, we create a monopolistic 


framework by restricting the price interval a an upper ang a we nonne. p< [ppl E 
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Figure 3.1.: Prospect theory 


If equation (3.2.1) is symmetric with respect to the effect of gains and losses (05 = (3), 
buyers are loss-neutral and the demand function is smooth. For loss-averse consumers, the 
demand function is steeper for losses than for gains ( 9» < (3 ) and consumers respond more 
to surcharges than to discounts. In other words, a loss decreases expected demand more than 
an equivalently sized gain would increase demand (see figure 3.1). This behavior is predicted 
by Prospect Theory (see e.g. Winer (1986)). If 82 > 04, consumers are loss-seeking. As 
Slonim and Garbarino (2002) show, 62 > (33 can also arise on the aggregate level when in 
fact the consumers behave according to Prospect Theory but stockpile when prices are low. 
We will focus on the loss-neutral and loss-averse case, which yield closed-formed steady 
state solutions, while the optimal pricing policy in the loss-seeking case cycles (see Popescu 
and Wu (2007) and figure 3.8). 


The reference price r, in equation (3.2.1) is given by some updating mechanism based on 
past prices such that recent occasions have greater effects than more distant ones and a higher 
previous price results in a higher current reference price. In the literature we observe several 
ways a reference price can be formed. One introduced by Krishnamurthi et al. (1992) is to 
operationalize reference price as the one-period lagged price for a brand: r; = p, 4. Another 
way could be summing past prices (see e.g. Winer (1986)). Exponential smoothing (intro- 
duced in the adaptive expectations framework by Nerlove (1985)) is the most commonly used 
and empirically validated reference price mechanism in literature (see e.g. Winer (1986), 
Greenleaf (1995), Kopalle et al. (1996), Fibich et al. (2003), Popescu and Wu (2007)): 


Definition 3.2. Let p, denote the observed selling price and r; the reference price for a 
specific brand in period t, then a reference price updating mechanism is given by 


T441 = Or, 4 (1 — a)n. (3.2.2 
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— price n, 


-e-reference price ne 0. 5) 


price/ reference price 


5 6 
time-period t 
Figure 3.2.: Formation of reference price 


where 0 € a < 1 denotes the memory parameter and captures how strongly the reference 
price depends on past prices (see figure 3.2). 


Remark 3.2. Note that lower values a represent a shorter term memory. In particular, if 
a = 0, the reference price is the one-period lagged price (r; = 74-1 ) as in Krishnamurthi 
et al. (1992). 


The memory parameter o used in equation (3.2.2) is estimated in such a way that the 
highest possible R? (quantifying the goodness of fit) for equation (3.2.1) is obtained in 
ordinary least square regression. Statistical parameter estimation is beyond the scope of 
this thesis, but there is considerable literature on empirical studies (e.g. Greenleaf (1995), 
Tellis (1988), Bijmolt et al. (2005), Ho and Zhang (2004), Popescu and Wu (2007), Natter 
et al. (2006), etc.) finding that a demand model like equation (3.2.1) fits empirical data very 
well and giving estimated parameter values for their models based on time-series data. E.g. 
Greenleaf (1995) and Kopalle et al. (1996) find furthermore that estimated parameters of a 
range from [0,0.925]. For a more detailed exposition of reference price mechanisms, see 
e.g. Kopalle and Lindsey-Mullikin (2003) and Moon et al. (2006). 


Similar to chapter 2, pricing decisions p,, where p € p, < p, are made at the beginning 
of each period t with the objective of maximizing total expected discounted profit over the 
entire planning horizon 7'. In marketing models, demand fulfillment is taken for granted and 
thus no inventory decisions are considered. Costs c in each time period £ are again assumed 
to be time invariant. Therefore the maximum total expected discounted profit V (ri), when 
the initial reference price at the beginning of the planning horizon is given by rı and cash 


flows occurring in subsequent time periods are discounted by a one ‚perigd discount, factor 
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7 € [0, 1) , is given as follows: 


f 
V(ri)- Mus 2 "(m — e)E [D(pi. mi. €i)] - (3.2.3) 


pıelp-Pl 


Similar to section 2.3, the decision which price to charge in each period is made in stages 
and cannot be viewed in isolation. Here again, the desire for high present profits, obtained 
by charging relatively low prices, must be balanced against the undesirability of low future 
profits, resulting from the formation of a low reference price as a consequence of the earlier 
price discounts. As the reference price summarizes past information which is relevant for 
future optimization, dynamic programming, which was introduced in section 2.3.1, is an 
appropriate technique for solving this problem. We thus rewrite equation (3.2.3) in terms of 
the Bellman equation: 


Mn) i i IU — c)D(p,r.,&)| + YE [Vi (ar + (1 — o)p)]) (3.2.4) 


with Vr, = 0. Note that state variables r, and decision variables p, are related via 
equation (3.2.2). For the convenience of the reader, the notation used in this chapter is 
briefly summarized in table 3.1 at the beginning of this section. 


3.3. Results 


3.3.1. Loss-neutral customer behavior 


An important consequence of the reference price formation described above is that although 
frequent price discounts may be beneficial in the short run, they may be dangerous in the long 
run when consumers get used to these discounts and reference prices drop. The reduced price 
becomes anticipated and loses its effectiveness, whereas the non-promoted price becomes 
unanticipated and would be perceived as a loss. As in section 2.3, we are interested in a 
possible steady state. Convergence analysis leads to the following theorem: 


Theorem 3.1. /f the dynamic program (3.2.4) admits a steady state, then for the linear 
demand function defined in equation (3.2.1) and the loss-neutral case ( By = 4) it is given 
by: 

ge (Ac — Bo)(1 — ay) + (1 Ye 


3.3.1 
zi 28,(1 — ay) + &X(1— 7) 


Proof. The optimal total expected discounted profit for an infinite time horizon is given by 
(compare (3.2.3)): 


V(r TN. max [( Pii — CED (pear. 75)]l. (3.3.2) 
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where rı denotes the starting reference price in the current period. Substituting the transition 
function (3.2.2) as we did when proving theorem 2.6 and 2.9, is not sufficient here since 
Titi = alrı) + (1 — a)p: still depends on the reference price r;. Thus we substitute p, = p 
and express the reference price r;,; in terms of the starting reference price rı and p: 


Tj41 = of (ry — p) +p. (3.3.3) 
This yields 


V(ri) = $Y [(p — OEID(p,o*rı — p) + P)]] , (3.3.4) 


which for the linear demand and loss-neutral case becomes 
V(r) = Y x [(p — fo + Arp — o! &s(ri — p))]. (3.3.5) 


Differentiating V(r,) with respect to p and setting equal to 0 yields 


28, bz fo—c — Pc o (3.3.6) 


urs E gus 1-y In 


(2p — T1) F 


Using the assumption that the reference price is in a steady state r we set r; = r for all t. 
From equation (3.2.2) it is clear that rı = r = p. Since V(rı) is concave, solving equation 
(3.3.6) yields the steady state price (3.3.1) for the loss neutral case. [] 


For non-differentiable demand functions and non-negative ordering costs the proof of the- 
orem 3.1 needs to be adjusted to a variational approach used in (Popescu and Wu 2007, Proof 
of theorem 1). For consistency we give a short sketch of the proof in the following: 


Theorem 3.2. If the dynamic program (3.2.4) admits a steady state, then for loss-neutral 
customer behavior ( f» = 3 ) it is given by: 


A((p — c)E(D(p.p,e)]) _ (s -o9ü- 2) (sss r, €)] - ElD(p p. €)]) 
(3.3.7) 


Op Lay ó(r — p) 


Proof. 'The proof of (Popescu and Wu 2007, theorem 1) is adjusted in such a way that the 
one-period expected profit II(p,r) is given by II(p,r) = 7(p) + II*(p,r), where now 
n(p) = (p— c)E[D(p, p. c)] denotes the expected profit in a market where consumers do not 
form reference effects and II*(p,r) = (p — c) R(r — p,r) denotes the expected reference 
profit, with R(r—p, r) being the reference effect R(r—p,r) = E|D(p,r,e)] - E[D(p, p, e)]. 
Hence the value function under policy p(ó) becomes 


V(r) = r(r—6)4-(r—ó—c) R(6, r) m (r--aó) --y(r--aó—c) R(—ó, r— (1—a)6) +7°V(r). 


; (3-3.8) 
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Figure 3.3.: Optimal price and reference price path 


Subtracting V(r) from V5(r) yields 


Vs(r) - V(r) = a(r — 6) — n(r) + y(n(r + ad) — n(r))4 


: _ (3.3.9) 
t(r—-ó—c)R(Ó.r) + y(r + ad — c)R(-ö,r — (1— a)ó) € 0. 
Dividing by ô, and letting ô go to zero, we rearrange terms to obtain 
(1-y)a(r) > (r -c)(1-Y)Alr). (3.3.10) 


where A(r) = R,(0,r). The same argument for the feasible policy p(—ó) yields the oppo- 
site inequality. Combining those two inequalities, an interior steady state must solve 


(1 — oy)n'(p) = (p — oy(1 — ¥)A(p). (3.3.11) 


which by substituting the above definitions for (p) and A(p) results in (3.3.7). [] 


Popescu and Wu (2007) show (in theorem 2 and lemma 4) that if the system admits a 
steady state, the optimal price path converges monotonously (under certain assumptions) 
to a constant steady state which they only give for the case of zero ordering costs and we 
in the above theorem extended to the case of c > 0. Furthermore, Kopalle et al. (1996) 
and Popescu and Wu (2007) show for time invariant parameters that if the reference level is 
initially high, an optimizing firm should consistently price below this level, which has the 
effect of a skimming strategy. Similarly, a low initial reference level leads to the optimality 
of a penetration type strategy. A numerical example for J) = 100, 6, = —20, B = B; = 


—40, a = 0.5, y = 0.5 and c = 4 is given in figure 3.3, LEA and reference 
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prices are restricted to the interval p,,r; € [4.2, 4.4] to ensure nonnegative demand for any 
combination of p; and r;. 


Remark 3.3. Note that the numerical example in figure 3.3 was obtained for a finite planning 
horizon T = 40. When considering a finite planning horizon, it is observed that prices are 
lowered towards the end of the horizon in order to benefit from reference effects. Keeping 
consumers' price expectations high is not reasonable when the end of a product's life cycle 
is reached and the remaining future selling periods do not suffice to outweigh the loss of 
profit a price promotion would induce in the promotion period. In this chapter we mainly 
focus on a steady state analysis (according to Popescu and Wu (2007)) and use a planning 
horizon T = 40 for numerical calculations, but only plot the results for a period of T = 25 
in order to eliminate the transient behavior at the end of the horizon and thus simulate an 
infinite horizon behavior. 


In the following we give some upper and lower bounds for the optimal steady state price 
p», anda sensitivity analysis in its parameters a, y and Pa. 


Theorem 3.3. The optimal steady state price pz, is decreasing in the memory parameter a, 
increasing in the discount factor ^; and decreasing in the reference effect |82|. Furthermore, 
it satisfies 

D E» SES (3.3.12) 


where p* denotes the myopic, one period profit optimizing price (^j = 0) and p*, denotes 
the optimal price in the absence of reference price ( 05 = 0). 


Proof. Since by definition 3.1 prices p are assumed to be greater than costs c and expected 
demand is assumed to be non-negative, it is clear that do + Pıc > 0. We furthermore know 
that the expected demand is increasing in reference price (95 < 0) and the discount factor 
y is bounded by [0, 1). By differentiating p*, with respect to a, y and 55, we thus obtain 


Ops, IU -I+ Pic) <0 (3.3.13) 


Oa (28,(1 — ay) + k — y)? 


Op, | —(1- a) tc) . (3.3.14) 


Oy | (2h(1 — ay) + B — 3)? 


Ops, _ (1— y) — a7) (So + Arc) 


0% (2fi(1 — ary) + Ball N)? 
which shows that the optimal steady state price p*, is decreasing in a, increasing in y and 
decreasing in |62|. Since pi, increases in 7, the first part of equation (3.3.12), p* € px, 
is clearly satisfied. Analogously, p*, < P% holds because p?, is decreasing in the reference 
effect |62]. O 


0, (3.3.15) 


Remark 3.4. Note that p}, equals the optimal steady state price given in equation (2.3.22) 


of section 2.3.3. 
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Figure 3.4.: Optimal and myopic pricing prices 


Remark 3.5. We observe that p* « p* from the above theorem states that the prices charged 
by a myopic firm are oblivious of their eroding effect on future demand, hence future profits, 
and thus are lesser than or equal to the optimal price. By charging higher prices, current 
profits are traded off for future long-term profitability from higher reference prices. Further- 
more, p* < PŽ, shows that the optimal price in the absence of a reference price is always 
greater than or equal to the optimal price obtained from the model including reference ef- 
fects, respectively. This means that in the long run, strategic firms should charge lower prices 
when consumers form reference effects, than when they do not. 


Figure 3.4 shows for our base scenario (Jp = 100.38; = —20,05 = 63 = —40.a = 
0.5, y = 0.5 and c = 4) how the optimal prices p; and the myopic prices p; evolute over 
time until they reach their steady states p% = 4.3 and p*. = 4.25. Figure 3.5 numerically 
underlays the results of theorem 3.3 and shows how the optimal steady state prices p*. vary 
with changes in the input parameters /5, y and o. Furthermore, note that the optimal 
steady state price in absence of reference price is given by p*. = 4.5. 


We now extend the above results to the loss-averse case ( 82 < (33) and give the reader an 
idea about what happens in the loss-seeking case ( 85 > ß3). 


3.3.2. Loss-averse and loss-seeking customer behavior 


This section investigates the transient and long term behavior of the optimal dynamic pricing 
policy under loss-averse customer behavior. 


Theorem 3.4. All optimal price paths p; and reference price paths rj solving (3.2.4) for 


the loss-averse case < converge monotonically to a steady state p* (ri), dependin 
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Figure 3.5.: Steady state prices under various parameters 


on the initial reference price rı : 


p; = (81c— flo (1—ay t 82(1—Yy c if T1 < Pp 


261 (1—-ay)+2(1—7) 


* ze * __ = m i = , * 
polri) = ¢ ps = ee gal (3.3.16) 
Tj else, 


where p», denotes a penetration type and p; denotes a skimming type steady state solution. 


For a proof of theorem 3.4 we refer the reader to (Popescu and Wu 2007, theorem 4). 


Remark 3.6. From equation (3.3.16) it becomes clear that in the loss-averse case there ex- 
ists more than one steady state, depending on the initial reference price rı. If initial price 
expectations are lower than p}, the optimal pricing strategy initially starts with a low price 
and monotonously increases prices until the steady state p; is reached. If initial price ex- 
pectations are higher than p;, the optimal pricing strategy initially starts with a high price 
and monotonously decreases prices until the steady state p; is reached. For initial price ex- 
pectations lying between the two possible steady states p; and p; , a constant pricing policy 
of the customer's initial price expectation rı is optimal. 


Figure 3.6 gives a numerical example of optimal price paths p; over time for different 
starting reference prices rı, where % = 100,0; = -20,% = —50, 83 = -30,@ = 
0.5,y= 0.5 and c = 4. An intuitive explanation of why in the loss-averse case there exists 
more than one steady state, depending on the initial reference price rı , is provided in figure 
3.7. The figure shows that for initial reference prices py < ri X p; the decision variable 
p'(ri) equals the state variable r, and thus the steady state is already reached. Letting 
Tı 2 pj or rı € p}, the optimal prices p; and reference prices rý converge monotonously 
over time until they reach the steady state p; or p>, respectively. 


If buyers are loss seeking (5 > (3) and thus respond more to discounts than to sur- 


charges, then problem (3.2.4) admits no steady state and the optimal pricing policy cycles 
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Figure 3.6.: Optimal price path (loss-averse) 
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Figure 3.7.: Optimal steady state price (loss-averse) 
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Figure 3.8.: Optimal price path (loss-seeking) 


(see figure 3.8). This type of asymmetry is inconsistent with prospect theory, but has found 
some empirical validation in the marketing literature (e.g. Greenleaf (1995) and Slonim and 
Garbarino (2002)). 
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4. Integrated Model with Reference 
Price Effects 


4.1. Introduction 


In the previous chapters we gave the reader a review of the state-of-the-art methodological 
literature and found that it divides into two rather distinct streams: The operations oriented 
stream described in chapter 2 and the marketing oriented stream described in chapter 3. 


Operations management oriented work, introduced in chapter 2, mainly deals with de- 
termining optimal production decisions and thus usually describes a firm’s possible cost 
structure very well: costs are assumed to be non-stationary, meaning that they can vary over 
time and fixed costs can be in included in the model. The limitation of this model is that it 
clearly relies on rather simplistic demand assumptions. Demand is modeled as a function 
of the current price only, not taking into account past prices, which clearly also influence 
customers’ buying decisions. 


The strength of the marketing models, introduced in chapter 3, is its rich demand model, 
which accounts for intertemporal demand correlations by incorporating both current price 
and the firm’s pricing history in the model. Yet, it has serious limitations as it uses a very 
simplistic cost structure which does not account for supply chain management interactions 
(e.g. stationary variable costs). But what is even more restricting, is that marketing takes de- 
mand fulfillment for granted. Thus, although the demand function is defined stochastically 
(see definition 3.1), the problem, by maximizing expected profits, diminishes to a determin- 
istic setting. In conclusion, both prevalent research streams consider only a partial picture of 
the relevant system. 


This work is devoted to combining the two above described literature streams: we want 
to use the rich cost models commonly used in operations research and combine them with 
demand models, which account for intertemporal demand correlation and have been mainly 
applied by marketing so far. 


4.2. Model formulation 


When using mathematics to solve real world problems our main aim is to obtain a mathemat- 
ical model that describes or represents the real situation as well as possible. The formulation 
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of a mathematical model is a challenging task: first the scientist needs to understand the 
problem and then has to identify generalizable principles and processes such that a complex 
system can be simplified/reduced to a tractable level that makes the essential structure of the 
system clear. Variables have to be defined and relationships between these variables have 
to be established. The key to a good model lies in which and how simplifications are intro- 
duced; it is very important to understand what aspects of the system the model is intended 
to describe, and at which the model’s limitations are as a result of the simplification. Hence, 
we now turn to the formulation of an integrated inventory control and pricing model, com- 
bining the strength and benefiting from the dynamics of each of the two models introduced 
in chapter 2 and 3. This new model should be much better capable of representing the real 
world situation. 


We consider a monopolistic retailer or manufacturer who maintains an inventory of a 
particular product and prior to facing random demand in each period t of a finite horizon 
T jointly determines a selling price and a stocking quantity. The integrated model with 
reference price effects is a combination of the two models introduced in chapter 2 and chapter 
3. Thus, we only briefly describe the variables used and refer the reader to the two previous 
chapters for further detail. Demand perturbations e, in different periods is assumed to be 
statistically independent and identically distributed according to general stochastic demand 
functions. We furthermore assume that consumers have a memory and demand not only 
depends on the current selling price p,, but also on a reference level r, resulting from 
the pricing history (see chapter 3). The inventory level z; is reviewed at regular intervals 
(periodic review model), and an appropriate y, — x, is ordered and a per unit selling price p, 
charged after each review at the beginning of a new period £. As in the previous chapters, 
we assume that all input variables are stationary and thus do not change over time. The 
ordering costs include a per unit variable cost c > 0 and a fixed setup cost k > 0 which 
is incurred only if an order is placed (y, > x+). Again, as in chapter 2, orders placed are 
essentially received immediately (received in time to meet demand that arises in that period). 
Costs are expressed in beginning-of-period cash units, cash flows occurring in subsequent 
time periods are discounted by one period discount factor y € (0,1). Each unit of positive 
left over stock at the end of each period incurs holding costs h > 0. If demand exceeds the 
inventory on hand, per unit backlogging (penalty) costs b are charged and demand is filled 
when the additional inventory becomes available. To insure that it is not optimal to never 
order anything and merely accumulate backlog penalty costs, we assume that b > (1 — y)c. 
After the last period, a final order is placed to fulfill backlogged demand. Furthermore, 
we for simplicity of the model assume that leftover units at the end of the horizon can be 
salvaged at the original ordering costs (v — c in chapter 2). For the convenience of the 
reader the notation is summarized in table 4.1. 


In the following, we model demand additively as in chapter 3 such that it is decreasing in 
price p,. Furthermore, consumers perceive a gain and thus demand increases, if the current 
price is lower than the reference price ( p, < 7; ) and consumers perceive a loss and demand 
decreases, if the current price is higher than the reference price ( p, > 7; ). 
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Variable Characteristics Description | 


I length of the planning horizon 


Dip. ri. €i) unit(s) demanded in period £ (random variable) 
EU 
fo 


D2p52p 


t 


li 


cdf of demand perturbation e; 


pdf of demand perturbation e, 


per unit sales price in period £ 


= 
SI 
IV 
S3 
IV 
I3 


per unit reference price in period t 


inventory level after ordering in period t 


= 


inventory level before ordering 


per unit production/ ordering cost 


fixed ordering (setup) cost 


discount factor 


memory parameter 


per unit backlog penalty costs 


per unit holding costs 


Ves, T4) expected optimal profit-to-go function 


Table 4.1.: Notation: Integrated model 


Definition 4.1. The stochastic demand is modeled by the piecewise linear function 


D (pe. Ti. €i) = Bo + bi prt 85: max(pi — 7,0) + 83: min(p, — re 0O} +e. (42.10) 


where r; denotes the reference pricing in period t, c, is iid. according to an arbitrary 
probability function with mean Ele] = 0, where f(-) denotes its density function and 
(-) its distribution function. Furthermore 6o > 0, Bı < 0 and 82. 83 < 0 ensure that 
demand is decreasing in price and non-decreasing in reference price. Moreover we assume 
that the expected demand E|D(p,, r+; €,)] is non-negative. 


Y 


Reference price r, is formed by exponential smoothing as in chapter 3. For the conve- 
nience of the reader we again give the definition here: 


Definition 4.2. Let p, denote the observed selling price and r, the reference price for a 
specific brand in period t, then for the memory parameter 0 < o < 1 a reference price 
updating mechanism is given by rą = or, + (1 — a)p,. 
zn 5 dioc 5E cB - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


56 CHAPTER 4. INTEGRATED MODEL WITH REFERENCE PRICE EFFECTS 


As in the previous chapters the objective is to maximize expected profit over the entire 
planning horizon T. The maximum total expected discounted profit v(zı,rı), when the 
initial inventory level at the beginning of the planning horizon is given by xı and the initial 
reference price is given by rı , is given as follows: 


T 
v(zi r1) = Xy max (pE [D(ps ri €)] — cly — te) — kó(ye — 2) - G(ye Pure), 
t=1 


YT, Pt 
(4.2.2) 
whereby cash flows occurring in subsequent time periods are again discounted by a one- 
period discount factor y € (0,1) and ó(u) = 1, if u > 0 and d(u) = 0, otherwise. 
Furthermore G;(yı, py, r;) denotes the expected holding and backlogging cost: 


yı-E[D(pı ‚ri ,‚e:)) 
Gnd f ED 
a (4.2.3) 
+b (E[D(p, rs, e&)] + u — y) f (u)du. 


yt—E[D(pe,re,ee)] 


Similar to section 2.3, the decision, which price to charge in each period and how much 
to order, is made in stages. Again the desire for high present profits, obtained by charging 
relatively low prices, must be balanced against the undesirability of low future profits. As 
described in section 2.3.1 this tradeoff is very well captured by the technique of dynamic 
programming. 


4.3. Dynamic program 


We now reformulate problem (4.2.2) in terms of dynamic programming using backward 
recursion as in section 2.3.1. The corresponding Bellman equation can be written as: 


VhT, Tt) = ca {il£ Ye, Des Te) }, (4.3.1) 
Jii, Yt, Pt, r) = prE|D(pe, Tt, 23] = cly = Le) = köly: "s Tt) — G (Yt, Pe, r,)+ (4.3.2) 
+ VE [vi (ye — D(psroe), arı (1 —o)p)]. 
with the terminal value 
UT+i(LT41,7T41) = CLT 41, (4.3.3) 


where G(y, py, r+) is defined as in equation (4.2.3). 


The system (4.3.1) is described in the states z+, the inventory level before ordering, and 
rą, the reference price level and the actions (decisions) y, , the inventory level after ordering, 
and p,, the per unit selling price. Note that the admissible actions are restricted to y; € 
[£+ 00) , since only non-negative orders are permitted and p, € [p,p], such that expected 


demand is non-negative and a monopolistic framework is created. 
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Decisions: ye43,pr41 = Decisions: y, Pe = 
argmax J+1l&+1,%e+13 Peor 7641) argmax Je (xe, ye, De , Te) 
B41 241, $i2m1, 

Dui Pr 


Staget+1 Stage t 


States Transition Transition States 


Fe=Ye+1—Det (Pega, Ten 61) Lest £z, 179c-Di(po Te, &) 


rn Fir Tt-1,7t-1 


rare + (1 — a)pe+ı T;-1—0r, + (1 — o)pi 


Return: Return: 
max Jr 1l2e+1, 3:41, Peri f241) max Je Le, ye, pe, 4) 
Yep 2% e411, yz, 
Disi P: 


Figure 4.1.: System dynamics 


For the convenience of the reader, the stages, states and decisions are summarized again 
in table 4.2. 


States Tt inventory before ordering in stage t 
Ti reference price in stage t 

Decisions Ye = Y(t: r1) | inventory after ordering in stage t 
De = pz. T4) | per unit selling price in stage t 


Table 4.2.: Characteristics of the dynamic program 


State and decision variables are related via the transition functions 
ZXía = Ye — Dil Pi: Tt. €). (4.3.4) 


Troy = Or, + (1 — a)pe. (4.3.5) 


Note that equation (4.3.4) gives the gross quantity of stock at the beginning of period t+ 1, 
which equals the inventory on hand after ordering at the beginning of period £ less the total 
quantity actually sold during that period (we refer the reader to figure 2.2 for a graphical 
illustration). Equation (4.3.5) gives the consumers’ reference price in period t + 1 which is 
formed from past prices by exponential smoothing with a memory parameter a (see figure 
3.2). A brief idea of the system dynamics is given in figure 4.1. 


In the following we focus on the special case of zero fixed ordering costs (k = 0) like in 


section 2.3.2 and 2.3.3). We then transform the above dynamic ‚program in such a way that 
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we subtract the variable ordering costs cz, from the value function v;(z;,r;). Hence, by 
letting V,(z;, ri) = v4(Zt, Tt) — cx, equation (4.3.2) becomes 


Jy po ro) = pE[D(pi re, &)] — cw — G(yo Pe, Te) + 
+ YE [Viri (ye - D(ps ro €t), onc (1 — o)p) + elyet - D(po re &))] = 
= prE{D(pe, re €)] — ey - G(Ye, Pe, Tt) + 
+ YE [Visi (Ye - D(pe, Te, €t), ore + (1 — o)p)] + yclyı - ELD (pe, re, &))), 
(4.3.6) 


and hence by omitting redundant subscripts equations (4.3.1) to (4.3.3) can be written as 
Vi(z,r) = max {Jı(y,p,r)}, (4.3.7) 
y2z,p 
Ay. p, r) = (p E yc) E|D(p, T, e)] ~ c(1 ~ y)y BH G(y, p, E 


+ YE [Va(y - D(p,r,€), ar + (1 — o)p)], 
Vr+(z, r) = 0, (4.3.9) 


(4.3.8) 


where again G(y, p,r) is defined as in equation (4.2.3). 


Remark 4.1. Solving the transformed model (4.3.7) to (4.3.9) yields great computational 
efficiency, since J,(y, p,r) now depends on only three variables instead of four in equation 
(4.3.1). Furthermore, as J,(y, p,r) no longer depends on the inventory level before ordering 
x, joint concavity in z follows trivially for the analytical proof of a base-stock policy and 
does not have to be stated explicitly as we did in theorem 2.5 and 2.7. 
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5. Analytical Analysis of the 
Integrated Model 


The system of the integrated model consists of two state and two decision variables (see table 
4.2) and is thus much more complex than the one discussed in chapter 2. This complicates 
an analytical analysis significantly and we can only prove structural results under restrictive 
assumptions. We start off by analyzing the single-period problem (newsvendor model) and 
then extend some of the obtained results to the two-period case. However, even in this 
simplest version of the model, we only consider loss neutral customer behavior ( 82 = ß3) 
to ensure analytical tractability. Thus the demand model (4.2.1) reduces to 


D(pi. T. €i) = Bo + Br Pe + Bo(py — Te) + €t- (5.0.1) 


Furthermore, for section 5.2 we assume that the random variable e, which follows an arbi- 
trary probability function f(-), is continuous and differentiable in order to avoid additional 
complexity of the analytical analysis for the two period model. At the end of this chapter, for 
the multi-period case we give an extension of the proof of Federgruen and Heching (1999) 
and show the optimality of a base-stock policy in the integrated model. After all, those re- 
sults are not of extreme practical relevance, since several assumptions on the demand and 
revenues have to be made, which many commonly used demand functions including the lin- 
ear one, defined in equation (5.0.1), do not fulfill. However, we will provide an extensive 
numerical study in chapter 6, where we show that in the cases analyzed the obtained results 
still hold under much less restrictive assumptions. 


5.1. One-period model 


We will start our analysis of equation (4.3.1) with the last period. The optimality of a base- 
stock-list-price policy follows directly from Federgruen and Heching (1999), since the refer- 
ence price r is only an additional parameter for the one period case. In this section however, 
we will provide an alternative proof and give implicit solutions for the optimal price and 
inventory level with respect to reference price, which will be used to extend the base-stock- 
list-price result to a two-period setting in section 5.2. To simplify the notation we in the 
following write, where not stated differently, D for D(p.r.c) and E[D] for E[D(p.r.«)]. 
Furthermore E[D,], E[D,], E[D,| denote the derivatives of expected demand E[D] with 
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respect to y, p, r. According to chapter 4, the expected one-period profit is given by 
E|ll(z, y, p, r, €)) = pE[D] - cly — z) - Gy, pr), (5.1.1) 


with expected holding and backlogging costs 


y—E|D] oo 


(y - E[D| — u)f (u)du +b J m +u-y)f(u)du. (5.1.2) 
2 


G(y,p,r) = Ji 


00 


Lemma 5.1. The expected profit E|lW(z, y, p, r,€)] is jointly concave and submodular (see 
definition 2.3) in y and p. Furthermore, E|I(z, y, p,7, €)| is strictly concave in p. 


Proof. By applying Leibniz's integration rule we obtain the following partial derivatives: 


OE|l(z, y, p, r, c)] 
Oy 

OPE {II (z, y, p, r, €)] 
Oy? 


= (b— c) — (h + b)F (y — E[D]), (5.1.3) 


= —(h +b) f(y — EID)), (5.1.4) 


SEN v pre p (o b)E[D] + (h-- b)ELD,]F(y - EID), 6.1.5) 


Op 
2 € 
etu = 2E[D,] + (p - b) E{Dpp] — (h + ) E[D,]” f(y — E[D])+ (5.1.6) 
+(h+b)EID„IF(y — E[D]), 
PEM YP)! _ (| ptp My E[D]. (5.1.7) 


OpOy 
From equation (4.2.1), it is easy to see that oe tegen < 0 and d « 0 
and hence the expected profit E|TI(z, y, p,r,c)] is concave in y and strictly concave in 
p. Furthermore, El|II(z, y, p,r,c)] is submodular in y and p by definition 2.3, since 
LETE yp < 0. Moreover the determinant of the Hesse matrix is 


PEN (2, y, p,r, €) à? E[I(z,y,p,r,e)) HEINE, y, p,r,e)) _ 
~ 80y æ py 
= —(h +b) f(y — E[D]) - (2E[D,] + (p — b)E[Dpp] + (h + 6) EÉ|Do]F(y — E[D])) > 0. 
(5.1.8) 


Hence, the Hesse matrix is positive definite, which ensures that E|II(z, y, p,r,€)} is jointly 
concave in p and y. " 


For later investigations we also need some structural properties of the expected holding 


and backlogging cost function G(y, p, r) , which we provide in the following. 
SEINE (y, p, 7), which we provide in the following. o. A 
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Lemma 5.2. The expected holding and backlogging costs G(y, p.r) defined in equation 
(5.1.2) are convex in y but not necessarily in p. Furthermore, G(y.p.r) is supermodular 
in y and p. 

Proof. The partial derivatives of G(y.p.r) with respect to y and p are given by: 


OG(y.p.r.e) _ 


ày —b + (h +b)F(y — E[D]). (5.1.9) 
O?G(y. p. r. €) 
u - (h + b)f(y — E[D]) 20. (5.1.10) 
u ne = bE|D,| - (h + b) E[D,]F(y - E[D]). (5.1.11) 
SEE f). bE[D,,] + (h + b) E|D;]? f(y — E[D]) — (A + 6$) E[Ds]F(y — E[D]). 


(5.1.12) 


Equation (4.2.1) ensures that G(y, p.r) is convex in y. Note that G(y. p,r) is not nec- 
essarily convex in p. Furthermore, 


O^ G(y. p. r.€) 


apoy t DELS (y — EID] 2 0. (5.1.13) 


which ensures the supermodularity in p and y. L] 


The above two lemmas lead to an optimal pricing and ordering policy (compare section 
2.2). 


Theorem 5.1 (Base-stock-list-price policy). For the linear demand function defined in equa- 
tion (5.0.1) and the system (4.3.1) to (4.3.3) and b > (1 — y)c, the optimal policy for the 
one-period case is a base-stock-list-price policy, where y*(x.r) and p*(x.r) are given by 


ie eT). eae) 
yare { z ele (5.1.14) 
V LP) "2 ) 
p'(z,r) zt Weiz. selbe (5.1.15) 
where the base-stock level S*(P*(r),r) is given by 
S*(P*(r).r) = Fo 1207s E|D(P"(r). r. €)]. (5.1.16) 
h+b 
and the list-price P*(r) is the unique solution to 
E(D(P*(r).r.e)) + ( Ca )- co)E[D (5.1.17) 
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Furthermore, the discounted price p*(x,r) is given implicitly by 


+(h + 0) E[D,(p*(z, r), r, e) F(x — E[D(p'(z, r), r, e))) = 0. (5.1.18) 


Proof. The expected total profit J(z, y, p,r) from equation (4.3.2) can be expressed in terms 
of the expected one-period profit E[II(z, y, p, r, €)] such that 


J(z,y,p,r) = E|l(z,y,p,r,c)] + yc(y — E[D(p, 7, €)]). (5.1.19) 


Since y — EID(p,r,e)] is trivially jointly concave in y and p, it follows directly from 
lemma 5.1 that the expected total profit J(x,y,p,r) is jointly concave in y and p. Thus 
the optimization problem over the two variables y and p can be reduced to an optimization 
problem over the single variable y as a function of p with subsequent substitution of the 
result back into J(z, y, p,r), which is thereafter solved for p. We in the following show 
the optimality of a base-stock policy and provide an optimal order-up-to level S*(r) as a 
function of price p and then continue with the price optimization. By setting equation 


MONPD-(-ü-se-(e5Fu-ED) 6.120 


equal to zero we obtain the solution to max, J (x, y, p,r) which is denoted by 


_ p-1 b=-(1-Yy)e | 
y(z,r,p)=F Gr n ) + E[D(p, r, c)]. (5.1.21) 


Since y(z,r, p) is not necessarily greater or equal to z , but the model only allows for non- 
negative orders (y > x), (5.1.21) gives the optimal solution to max,>, J(z, y, p,r) only 
in the case y(z,r,p) > x. In the case of y(z,r, p) < x the optimal policy is not to order 
(argmax, >J (£, y, p,r) = z), since J(z, y, p,r) is concave in y. Thus a base-stock policy 
(compare definition 2.1) with order-up-to-level 


* _ p-1 b= (1 — DE 
S*(p,r) = F (a re ) + E[D(p,r, e)] (5.1.22) 
is optimal. 


For proving equations (5.1.17) and (5.1.18) we now need to distinguish between the two 
cases x < S"(p,r) and z > S"(p,r). 


Let x < S*(p,r). For notational convenience we denote y" :— F^! (Exe), which 


ields S*(p,r) = y" + E|D(p,r,c)|. In order to find the optimal list-price p we substitute 
y (pr) = y ENGT]: a Aimpi-Hedraink - 9/8 3-631-75380-4 
Downloaded "em PubFactory at 01/11/2019 05:41:29AM 


via free access 


5.1. ONE-PERIOD MODEL 63 


y=S*(p,r) into J(z, y, p,r) which by using equation (5.1.1) and equation (5.1.19) gives 
It, S"(p. Ts) == pE|D(p. f, €)] c((1 = yy? T E|D(p. T. €)] v zj- 


7 hf "GP -ufudu-b f (uy) flue. p 


—2c y^ 
By differentiating (5.1.23) with respect to p we obtain 


ee = ElD(p.r,e)) + (p — c)E[D,(p. r. €)]. (5.1.24) 


Since J(r,y.p.r) is jointly concave in p and y, equation (5.1.17) follows directly by 
setting equation (5.1.24) equal to zero. 


Let x > S*(p.r) which yields y*(x.r,p) = x. Then substituting y = x gives 


J(z,z.p,r) = (p — yc) E[D(p. r.e)] + yer— 


z—E|D(p.r.«)] 
-Af (@- ElDiprol-wftudu- 5125) 
- 28 (EID(p.r.e)| + u — x)f(u)du. 
z— E[D(p.r.c)] 


Differentiation with respect to p results in 


9J(z.r.p.r) 


5 3 EID.) +(e 9e~ OE (Dan €)]+ 


+ (h + b)E[D,(p. r, e) F(x — E[D(p.r.€)]). 


(5.1.26) 


Since J(z,y.p.r) is jointly concave in p and y equation (5.1.18) results from setting equa- 
tion (5.1.26) equal to zero. 


In order to prove the optimality of a list-price policy, we need to show that p*(z.r) is 
unique and non-increasing in x. By equation (5.1.19) and demand D(p,r,¢) being concave 
in p it follows by lemma 5.1 that J(z. y. p.r) is strictly concave in p and thus the optimal 
price p'(r.r) is unique. Furthermore, J(x.y.p.r) is submodular in y and p. It follows 
from theorem 2.8.1 in Topkis (1998) that the optimal price p'(y.r) is non-increasing in y 
and hence in x. Substituting the optimal list price p = P*(r) in equation (5.1.22) verifies 
equation (5.1.16). [) 


Remark 5.1. By the assumption b > (1 — y)c in theorem 5.1 we exclude the trivial solution 
of not placing any orders and accumulating backlogging costs until the end of the planning 


horizon T (compare section 2.2.1). Thus F^! (He) is well defined and bounded by 
(0, 1]. 


Remark 5.2. Note that lemma 5.1 and 5.2 as well as theorem 5.1 also hold for more general 


demand functions D(p,r.¢€), which are decreasing in price p, non u in reference 
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Optimal price p` 


S'(r) - 
Inventory before ordering x an Reference price r 


Figure 5.1.: List-price policy in reference price 


price r and concave in both p and r. The demand function defined in (4.2.1) of course 
fulfills these assumptions. 


Figure 5.1 and figure 5.2 give a graphical description of the policy, which we showed to be 
optimal in the above theorem. They are an extension to figure 2.1 and figure 2.7 in chapter 
2, with the difference that now the optimal decisions depend on two states: the inventory 
before ordering x and the reference price r. Note that the dependency of the optimal price 
p' and optimal inventory level after ordering y* on reference price r adds an additional 
dimension to the solution space. It becomes clear that in contrast to chapter 2, the base-stock 
level S*(r) and the optimal price p*(x,r) depend on the consumers’ price expectation r. 
Looking at figures 5.1 and 5.2, the question arises whether new structural properties of the 
optimal policies in the reference price r can be formulated. This leads to the following 
theorem, where for the one period case and loss-neutral customer behavior we show that 
both the optimal pricing and ordering policy are non-decreasing in the reference price r. 


To prove theorem 5.2 we introduce the theory of implicit differentiation (Heuser 1981, 
theorem 170.1) in the lemma below. 


Lemma 5.3 (Implicit differentiation). Let G C R” and H C R™ nonempty open sets 
and € € G and n € H. Furthermore let F : G x H — R” be a continuous function 
with F(£,9) = 0, F’(€,n) is well defined and PAG) invertible. If there exists a ö- 
neighborhood U C G of £, an e-neighborhood V C H of ņ and a continuous function 
f :U — V with f(£) =n and F(z, f(x)) = 0 forall x EU then f is differentiable at 
€ and f'(£) is given by 


-1 
re--(Sem) Sm. (5127) 
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RI 


Inventory after ordering y` 


^| 


S* (T) 
S 


*(r) 


Inventory before ordering x 0 á 


Reference price r 


Figure 5.2.: Base-stock policy in reference price 


Theorem 5.2. For the linear demand function (5.0.1) and the system defined in (4.3.1) to 
(4.3.3), both the optimal base-stock level S*(p*.r) and the optimal pricing policy p'(x.r) 
are non-decreasing in the reference price r. 


Proof. Since for the linear demand function (4.2.1) the second and mixed partial derivatives 
of demand D(p.r,e) are zero ( Dy,(p. r.e) = 0, D, (p.r.e) — 0, Dy(p.r.e) = 0), we by 
lemma 5.3 obtain the partial derivative of the optimal price p'(z.r): 


* E[D;] " * * 
SL m | E (h4-b) E[D,] E[D,] F" (z— E[D]) d Lu. (5.1.28) 
r|- (A > r '(z— * * E ms 
or -ED Arra- 2 > SP (r)r) 


where we use the short notation D = D(p'(z.r).r.e), Dp = D,(p*(z,r).r,e) and D, = 
D.(p*(z,r).r.e). Since D,(p.r.e) « 0 and D.(p.r.e) > 0 forall p and r, it is easy to 
see that equation (5.1.28) is always greater than zero and thus the optimal price p*(x.r) is 
non-decreasing in the reference price r. From equations (5.1.16) and (5.1.28) it follows that 
the base-stock S*(p*.r) is also non-decreasing in r: 


95° EID,.(P*(r).r.e)] 


3, Pr) = -EID,(Pr).r. AED P.r o] + E[D,(P'(r). r. €)] 


(5.1.29) 


u E[D.(P*(r). r, €)] sü 
Tuc ee 


N 


Remark 5.3. Theorem 5.2 shows that for loss-neutral customer behavior ( 32 = ß3 ), the opti- 


mal price p*(r.r) as well as the base-stock level S*(r) are non-decreasing in the reference 
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——loss-neutral case (B,=ß,) 
---loss-averse case (p.«p..) 


"3l 


P*(r)=r 


List-price P* 


i 


T Reference price r 


Figure 5.3.: List-price in reference price 


price r. As illustrated in figure 5.3, one can see that this result can be extended to loss- 
averse customer behavior for the optimal price p*(z,r). However, the base-stock S*(r) is 
only increasing in reference price in the loss neutral case. This is not true in the case of loss- 
aversion, where the base-stock S*(r) is not monotonous in reference price (see figure 5.4). 
While in the loss-neutral case the list-price P*(r) is a smooth (continuous) function in r, 
since expected demand is a smooth function in p and r due to 95 = (33, in the loss-averse 
case expected demand is a kinked function in p and r (with two different slopes depending 
on p < r or p > r, respectively) and therefore the list-price P*(r) is a kinked function 
in r. Note that this relation results in a list-price P*(r) that is threefold: P*(r) « r, 
P*(r) =r and P*(r) > r. It is clear that for all reference prices r with P*(r) = r the 
corresponding base-stock level S*(r,r) from equation (5.1.22) is decreasing, since 


(r,r) = A <0 (5.1.30) 


and thus an optimal inventory policy is no longer monotonous in reference price r for loss- 
averse customer behavior (see figure 5.4). 


From the above remark it becomes clear that loss-averse customer behavior already adds 
considerable complexity to the model in the one-period case and thus significant additional 
dynamics in the multi-period setting. In this thesis we mainly concentrate on the loss neutral 


case. 
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—loss-neutral case (B,=ß,) 


---loss-averse case (B*p.) 


Base-stock S' 


In 


Reference price r 


al 


Figure 5.4.: Base-stock in reference price 


5.2. Two-period model 


We are now in the position to add one period to the planning horizon and study the two period 
case. This is not only a theoretical exercise but also has significant practical relevance on its 
own: due to shortening life-cycles an increasing fraction of a retailer’s assortment consists 
of products where there is only one reorder possibility. Due to long production lead times 
there is as a consequence only one possibility for reordering after the initial order is placed, 
which motivates the application of a two-time-period model. 


The dynamic program defined in (4.3.1) to (4.3.3) will be used in this section for t = 1.2. 
Corresponding to (4.3.4) and (4.3.5) the transition functions are given by 


T2 = Zalyı.Pı:rı.c) = Yr — Dy (pi... €1). (5.2.1) 
T2 = T2(pı- r1) = ar, + (1 — a)pi. (5.2.2) 


where the subscript ' 1 ° denotes the first time period and ' 2’ the second time period. Thus 
Vo(za. r2) is rewritten as Vo(zo(yi. pi. 1, €i). To(p1. r1)) - 


In Federgruen and Heching (1999), joint concavity of the value function J,(zi. Yı- px) 
is used to show the optimality of a base-stock policy. However, this approach does not 
work in the case of reference prices. In this case, Vo(xo(yi. pi. ri. €1). r2(pi. ri)) is not 
necessarily concave in p; and therefore we cannot conclude with further investigations that 
Ji(z1.y1. pı) is jointly concave in yy and pı. 


Lemma 5.4. For the linear demand function (5.0.1), the system defined in (4.3.1) to (4.3.3) 
and T = 2, the value function of the second time period v3(xo(yi. py. r1. €1). T2(pi. ri)). is 


not necessarily concave in the selling price of the first time panor l 
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Proof. Since J(22, ya, po, r2) = EI (xe, yo, po, 2, €2)] + yc(y — E[D(pa, r2, €2)]) , the ex- 
pected profit of the last time-period J2(X2, y», po, r2) is jointly concave in inventory level 
y» and price pa by lemma 5.1 and thus a base-stock-policy is optimal (see theorem 5.1). 
Hence, the last period's optimal profit can be rewritten by substituting h by h — yc and b 
by 6 4- yc: 


v3(zo T4) - Vj (ra) ‚22 < S3 (r2) 
? $05(ro,r2) else 


l E[Nlz(&2, S* (r2), P*(r2),72,€2)], £2 < S3(T2) 
E\Mo(x2, £2, p3(x2, T2), T2, €2)], else. 


(5.2.3) 


Substituting the transition functions (5.2.1) and (5.2.2), the optimal value function be- 
comes v»(z5,r2) = vo(zxo(yi, py, r1, €1), r2(p1, rı)). Note that the holding costs in the sec- 
ond time-period are given by h — yc, and the backlogging costs by b + yc, respectively 
(compare page 22). Thus by substituting h by h — yc and b by b + yc, the total optimal 
profit u5(zo, r2) is given by E|[II2(z2, y* (za, r2), p3(z2, r2), ra(pi, 71); €2)]. 


Although Vž(r2) = E|[II(z», S*(r2), P*(r2), r2, e2)] is concave in p; by lemma A2 in 
the appendix A, this is not true for $05(x5, r2) = El[II;(z2, £2, p3(L2, r2), 2, €2)|. Lemma 
A.3 shows that 0?95(x5, r3)/Op? is not necessarily less than zero. Hence, concavity is not 
guaranteed for $05(z5,72) in pı and thus we cannot conclude that the optimal profit of the 


second time period v3(xa(yi, p1, r1, €1), r2(p1, r1)) is concave in pı. oO 
case S*(r1) >0:  y2-#ID}-s"(r1) va-E[D)] 
nj + yj ni Vr na Hu Ul 
case S*(rı)=0: y2-E[D) 
ni + yO} nb -- 4v; u2 
case S*(rı) <0:  w-BlPl y2-F[D]-S*(v1) 
n + Bj Db + Bi DEW 42 


Figure 5.5.: Switches between functions II, (21,91, p1, 71, €1) and vo(z5, T2) 


In order to prove that a base-stock policy is still optimal in the first-time period, we will 
show that Ji(yi,p1, 21,71) is jointly concave in y; and pi, although V2(zx2,r2) is not 
jointly concave in y; and p; (see lemma 5.4). Using equation (4.3.2), the expected profit of 
the first period can be written as 


J m) = Efl (zi, yi pini, y E|V. Puri 6), 
Gsm po n) = Else pone fO EPA Pro Gro Uta RU 
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By using the short notation E[D(pı.rı.cı)| = E|D], profit of the first time period then 

becomes 

Ira, y1.P1ı 71,40) »a<y-ElD] _ 

(zi yip) «29 — E[D] 

nm ta) -cly — 21) - hi - EID] ~ 6) «gw - E[D] (52.4) 
pi(E[D] + €1) - cun - 21) -(E[D] 6-9) 14 2 yı — ELD). 


Ty (21.41; pi: T1. €1) — | 


Similarly we distinguish between the possible realizations of vo(zo(yi. pi. r1). r2(pi. rı)): 


va(Tə(Y1: P1: 71.6). 72(P1.71)) = 
Dra .& 2 yi — E[D| - S*(r3) _ 
95(xo(y1. m. rphtTuphn) -a <y- E|D| = S*(r3) 
MP S*(r2). P(r3).r2) -€ 2 yy - E[D] - S*(r2) 


(5.2.5) 
Jo(X2, L2, P}(r2).r2) pO E|D} = S* (r2). 


We now exchange the order of summation and taking expected values such that 


Ji(zi-yi pi. 71) = EM, (zi. y1. $1.71. €1) + yva(zo(yi. Pi: 1. €1): r2(p1:r1))] x: 


i IB(zi. yi pi. Tiu). ui < y — E[D| 
u [HB piri). ur > yy — EID] 


Vo(r2(P1- 71) ‚u 2 yı — E[D] - S*(r2) flu)dy. (5.2.6) 
,95(xa(yi. P1; T1- U1), 72(P1-71)) -u1 < yı — E[D] - S*(r2) 


From the formula above it is easy to see that we need to distinguish between the three 
cases S*(ro) > 0, S*(r2) = 0 and S*(r2) < 0 (see figure 5.5). Let S*(r2) > 0, then 
Ji(zi. Y1: pi. ri) becomes 


Ji(z. i. pi. ri) = 
yi - E[D]- S* (r2) 
= / (Ur. Y1: P1- Ti U1) + o895(xo(yi P1: T1. 1). r2(i. r1))) f(u1)du, 


-X 


yi - E[D] 
+ / (If (21. yi giu) + YVz (r2(pi-71))) f(u)du, 
yi - E[D|-S* (r2) 


+f E(D| (Ty (21. Y1; $i r1: 01) + Ve (r2(i ))) f(u du. 
Yyı 
(5.2.7) 
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For S*(r2) = 0 the profit function Ji(zi, y1, p1, r1) can be written as 
A(zxi.yi pori) = 


yi - E[D] 
= f (TÈ (£1, Y1, Pi: r1, U1) + 8902 (zo(yi, Pi, Ti 1), ro(pi,T1))) f(ur)duy 


oc 


i J [D] (IÈ (£1, Y1, Pi, T1, u1) + Y V4 (r2(P1,71))) Fun )dun. 
y1- E|D 


(5.2.8) 
We now consider the case S*(r2) < 0, which yields 
Jia, Yı, Pi, rı) m 


yi — E[D] 
= / (IR (3, Yı: P1; T1, u) + yD2(zalyı,pı, T1; u), ra(pı, r1))) f(ur)duy 


oc 


v-BlDI-5°(n) 
«f " (T (21, Y1, py, T1, U1) + 802 (zo(yi, Pi, T1, Ur), r2(1,71))) f(ui)dus 
yi—E[D 


«f (TÈ (x1, Y1, Pi, 71, U1) + V5'(ro(p1,71))) f(ur)dur. 
y 


1 - E[D]-S* (r2) 
(5.2.9) 


We will now show that the four possible summands are jointly concave in y; and pı. 


Lemma 5.5. For the linear demand function defined in equation (5.0.1), the system (4.3.1) 
to (4.3.3) and a two-period setting T = 2 each of the functions 


IE (21, yi, Pis 01, €) + YV2 (ra(pi, r1)), (5.2.10) 
I (1, Yas Pas r1, €1) + YVZ (r2(pi, rı)), (5.2.11) 
Id (1, Y1, Pi, 71, €1) + Wlzalyı, P1, T1, €), P2(P1,71)), (5.2.12) 
I (21,91, pi 71, €) + YV%laalyı, pi T1, €1), r2(p1, i) (5.2.13) 


is jointly concave in yı and pi. 


Proof. The first two functions are trivially jointly concave in y; and pı, since the two 
possible realizations of the profit function in the first time-period II^ (i ‚Yı,Pı1,Tı,€ı) and 
IÈ (£1, Y1, P1, r1, €1) are jointly concave by lemma A.1 in appendix A and V*(rz(pı, rı)) is 
jointly concave by lemma A.2 with a discount factor y > 0. 


The situation is not so clear for the second two functions, since the expected optimal 
profit $05(zo(y1, p1, r1, €1), T2(p1, r1)) is not necessarily concave in p; by equation (A.10) 
in lemma A.3 and thus ?05(zo(yi, P1, r1, €1), r2(p1, rı)) is not jointly concave in yı and pi. 
However, in the following we will show that the joint concavity of the first time period's 


profits IT;(z1,y1, py, Tv, €1) is strong enough to comina ii non-concayity of the futire 
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profit. It is easy to see that the 'misbehavior' of non-concavity will be worse, the larger the 
discount factor ^; is. Hence, without loss of generality, we examine the case of no discount 
(y = 1) in the following analysis. 


As a first step we will now show that II;(z1.34. p1. r1. €1) + U3(X2, r2) is concave in 
the selling price pı. By using lemma A.1 and lemma A.3 from appendix A, as well as the 
assumptions that 61. 62 < 0 and 0 € a < 1, we can calculate the following for linear 
demand function (5.0.1): 


a" (Il (21, Y1: pi Ti. 6€) + 905 (xo(yi pi 1.6). T2(P1-71))) = 
Op; 
| .2(28i + (1 — @)@2)[(h + b)(8y — 83)* + ((a-3) Bo — 20)/ (2f (z2 — E[D(pa. re. enl 


(Bı + 82)(2/ f(z2 — E[D(pa. r2. €2)]) — (h + b)(61 + 82)) 
(5.2.14) 


which is less than or equal to zero and thus proves concavity in p; . It remains to show joint 
concavity in y; and p; . For this purpose we evaluate, by again using lemmas A.1 and A.3, 
the determinant of the Hesse matrix of Il, (x1, yi. p1. 71.61) 05 (xo(y1, p1; r1, €1): T2(p1. 1)) 
for the linear demand function (5.0.1), where H{-] denotes the Hesse matrix: 


det (H [I (zi. y1; $1.73. 6))] + H [V}(T2(y1. pi. T1: €1)-Te(pi-71))]) = 


(emu. I (zo(y:. py. T1. €1), ro(pi. r1) J 
Opt Op; 
(Sef pete o? 985(xo(y1. Pı: T1- €1). T2(p1. T1) ')- 
erg + t—— Bean ea SE FE RB Be 
Oy; Oy; 
EE 3 0° 985(ro(yi. P1- r1. €1). T2(P1; T1 > 
Oy Op Oy Op 
_ (8(5 + b2)? — (1 — a)? b2 )(h +b) > ( 
(81 + 83)(2/ f(zo — ElD(p2.r2)]) — (h + b)(Aı + Be)) ~ 


(5.2.15) 


The above equation shows that the Hesse matrix of Ih (21. yi. pi. 71, €1) + U5 (x2. r2) is 
positive semi-definite, which yields joint concavity in the decision variables y; and p, and 
proves the lemma. [] 


It now remains to be shown that the profit functions of the first and second time-period, 
respectively, are continuous and concave in their points of non-differentiability. 


Lemma 5.6. The two functions TI, (21: yi. pi. 71, €1) and vo(xo(yi. pi. ri. €1). T2(p1. r1)) de- 
fined piecewise in equations (5.2.4) and (5.2.5) are continuous and concave in yj and p, in 
their points of non-differentiability. 
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Proof. For proving the above lemma 5.6 it suffices to show the following properties for any 
value of €; = y; - E[D(py,r1, 6&)]: 


lim [Ih(zi, y1, Pı — 0,756) — Mi (£1, Y1; pı + 6,71, €1)] = 0, 


6-0 

lim (Thy (£1, 41 = Ô, p1, T1, €1) = Ih, (21, 41 + 6, Pi, 1, €1)] = Q, (5.2.16) 
im [Th vp 97,6) A nat) ng 

80 Op} Op; 

im [Ubi wm -spra Ahen- spm a] (5217 
$0 Oy; Oy 


Furthermore, we need to show for any value of €; = yı — E[D(pi, r1, cı)] — S*(ra(pı, rı)) 
that the following equations hold true: 


lim [va(zo(yi, Pi —9, r1, €1), To(p1 —9, r1) — vo(xo(ys, Pi +ô, r1, €1), r2(pı +4, 71))| = 0, 
lim lva(xa(yı —ô, Pi, ri, €1), r2(p1, 71) — valzalyı +6, pi, 71, €1), ralpı,rı))] = 9, 


(5.2.18) 
l Ovo(za(yi,*1 6,1, €1), r2(p -à,m)) 9v (zxo(yi pi t ri 1), Pa(P1+4,71)) | _ 
im | —— ee eee FIIR a a = 0, 
420 Opi Opi 
ji Ovo(zo(yv1 -Ó, pi, T1, à) T2(P1;T1)) 9v (zxo(ji +4, pı, T1, €1), r2(P1;T1))] _ 
im | — eee a LO 
8—0 Oy Oy 
(5.2.19) 


Equations (5.2.16) and (5.2.18) show the functions’ continuity and equations (5.2.17) and 
(5.2.19) the functions’ concavity in y; and pı. Lemma A.5 and lemma A.6 in appendix A 
shows that the above equations indeed hold. O 


We will now show that the function I; (z1, Yı, p1, 71, €1) is jointly concave in y; and pi 
at the kink e; = yi — E[D(pi,r1, &)]. 


Lemma 5.7. The profit function of the first time period Ily(z3,y1, py, T1, €:) defined piece- 
wise in equation (5.2.4) is jointly concave in y and p. 


Proof. We know by lemma A.1 in appendix A that the functions IU (z1,y1, p1, r1, €i) and 
IP (21, Yı,Pı,Tı,€,) are both jointly concave in y, and pı. It now remains to be shown 
that IT; (21, y1, Pi, 1, €1) is jointly concave at the kink €; = yi — E[D(pi,r1, €1)]. We for 
convenience reformulate equation (5.2.4) in the following way: 


Thé n Ben) = PELDE here WP Tefal 779 
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with 


h(y - E|D(pi ri. €1)] =e) ‚a<sy - E[D(pi. 71. €1)] 
-b(yı — E[D(p.ri.) à) ‚a2yı — ElD(p.nri.&)l. 
(5.2.21) 
It is clear that p( E[D(pj,r1,€1)] + €) — c(yı — 21) is jointly concave in yj and pı. Let 
(91:91): 9ı = E[D(pi.ri.&3)] 61) and ((5.$1) : i A E[D(p. i. €1)| +e}. It is easy 
to see from equation (5.2.21) and b, h > 0 that 


g(yi mi 71,61) = | 


g(Yı: Pi. T1. €1) < gn. pi 71.6). (5.2.22) 


Since y; — E[D(pi,ri,6€1)] — & is linear in y; it follows directly that g(yi. p1. 71. €1) is 
jointly convex and thus II; (zi. yi, p1. 71, €1) jointly concave in y; and p; at the kink e; = 
yı — EID(pı.rı.cı)] and therefore anywhere. g 


We are now ready to show the optimality of a base-stock-list-price policy, for which we 
introduce another useful lemma, which can be found in Heuser (1981). 


Lemma 5.8 (Heuser (1981), theorem 166.1). Let the function f : G C R” ^R (G being 
an open set) be differentiable in G’. Then the derivative in direction a = (a4, ... , an) exists 
and is given by 


Ofte) S> A0) 


da er 


(5.2.23) 


i=1 


Theorem 5.3 (Base-stock-list-price policy). For the linear demand function defined in equa- 
tion (5.0.1) and the system (4.3.1) to (4.3.3) , the following holds for t = 1.2 of a two-period 
setting T = 2: 


1. A base-stock-list-price policy is optimal. 
2. The profit function Ji(Yi, Pi. x4. Ti) is jointly concave in y, and p,. 


3. The profit function Ji (yi. pi, 24, 74) is submodular in y, and p. 


Proof. The above statement is true for the second time period £ — 2 by theorem 5.1. It 
remains to be shown that Ji(zi. yi. P1. 71) is jointly concave and submodular in y; and pı. 


In lemma 5.5 we showed that the functions defined in equations (5.2.10) to (5.2.13) are 
each jointly concave in yı and pı. We now need to show that the junctions of those 
functions are indeed jointly concave. It is convenient that we already know from equa- 
tion (A.20) and (A.21) in lemma A.6 in appendix A that when Vš (ro(pi.ri1)) switches to 
905 (zo(yi. pi. T1. €1). 2(p1. 1)) , the difference of its slopes with respect to y; and p; is 
zero and thus by vs(ro(yi. p1. T1, €1). T2(p1. r1)) being continuous, it is also differentiable 


! A function f is differentiable in G C R” , if the partial derivatives of f with respect to zi..... In exist 


for any point ZEG. 
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with respect to y; and pı. By Lemma 5.8 it follows that vo(zo(yi, p1, 1, €1), T2(p, 1)) is 
differentiable in any direction a = aıyı + aapı and thus is a smooth function. Therefore the 
problem of showing that the junctions of (5.2.10) to (5.2.13) are jointly concave reduces to 
the verification that the junction of the two piecewise functions 


IP(z,yipi,ri60) + vyvo(xo(yi pi r1, €1), r2(91, 71)) (5.2.24) 
IÈ (£1,Y1, P1,161) + Yurlzalyı, pi, ri €1), 2(p i) (5.2.23) 


is indeed jointly concave, which follows from lemma 5.7. 


It now remains to be shown that J;(21, Y1, 21,71) is submodular in y; and p, and thus 
8? J (21,91, P1, 1)/ (018 p1) < 0. Since we have 0711; (21, y1, 91, 1, €1)/(Oy1.0p1) = 0 by 
equation (A.3) in lemma A.1, Vf (ra(pı,rı))/(dyıdpı) = 0 by equation (A.7) in lemma 
A.2 and 07%$(xe(41, py, ru, €1), r2(p1,1))/(09180p1) < 0 by equation (A.12) in lemma 
A.3, and submodularity is maintained by integration it is clear that Jı(zı,yı,Ppı,rı) = 
EIN, (21, 91,1, 01, €1)] + Yualzalyı, P1, r1, €1) 72(p1, 1)) is submodular in y; and p; . By 
theorem 8-4 in Heyman and Sobel (2004) submodularity in yı and pı suffices to show that 
the optimal price pf(xı,rı) is non-increasing in x, and a list-price policy is optimal. U 


5.3. Multi-period model 


Under some restrictive assumptions we are able to extend the above attained base-stock 
property to the multi-period-case. For the ease of proving we now consider the transformed 
model given by equations (4.3.7) to (4.3.9) in chapter 4. Furthermore, we introduce the 
following assumptions: 


Assumption 5.1. In each time period t=1,...,T the following holds: The demand func- 
tion D,(p, r, €) is non-increasing in p, non-decreasing in r and jointly concave in p and r, 
while the revenues pD;(p, r,c) are assumed to be jointly concave in p and r. Furthermore, 
Gily, p, T) is assumed to be jointly convex in y, p and r. 


Lemma 5.9. The expected profit-to-go function V;(r, x) is non-increasing in x for all r 
and 114.1. 


Proof. Let z; < 22. Then V(r,zi) = maxysz,, (y, pr) 2 maxyss,, Ay, pr) = 
Vir, £2) . o 
Theorem 5.4 (Base-stock policy). For the system (4.3.7) to (4.3.9) and assumption 5.1, the 
following holds for any time period t = 1,...,T': 

1. J(y,p,r) is jointly concave in y,p and r. 

2. V(x, r) is jointly concave in x and r. 

3. A base-stock policy with order-up-to level S¥(x,r) is optimal in time period t. 
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Proof. The proof technique is similar to the one suggested in theorem 2.5 and is conducted 
by induction. Vr+1(x. r) = 0 is trivially jointly concave in x and r and thus Jr(y.p.r) is 
jointly concave in y, p and r by assumption 5.1. We now assume that V;,,(x,r) is jointly 
concave in z and r and Jııı(y,p,r) is jointly concave in y. p and r. 


By defining r(p.r) and z(y. p. r.c) as 


T(p.r) ^ or t (1 — a)p. (5.3.1) 
T(y.p.r.ce) = y — Dilp.r.e). (5.3.2) 


the following holds for any pair (yi. y2), (pi. po) and (ri. r5): 


M +m ntr y t y 2 nr 
qtue n p2 jb 2) = Br. p (BER ! 2E 


2  — 2 2 ^ 2 2 2 
+ 1 
A = 5 (p. ri. €) + Di (pa. 72. €)) = 
1 1 P l1. 
zM — D; (pı. r1.€)) + 5% — D; (p2. r2,€)) = 3n; P. ry € 2t Pe. T2. €). 
(5.3.3) 
since D,(p.r,e) is jointly concave in p and r by assumption 5.1. Thus we obtain 
cp Us Dear Dy Tip se Pir Pa fps fe 
V, —, — 6). (=. ——_) ] > 
pi (aie mom nin tr nim). 
le: l1 ~ Pi + p2 rı tr: 
Va (Saturn. + en. pa: ra, e) 7. 22) = 
(5.3.4) 


b I 1. l1. 
Vea (sw. n. à + 5T. pa: T2. €). 5^ un- rı) + sn) > 


1 " " 1 = " 
z Vea (zn. pi; rise) T(p1.r1)) + z Ve ruo. pa. T3. €). T (pa. T2)). 


The first inequality of (5.3.4) holds, due to Lemma 5.9 and the following equality since 
p(Bzz, Titre) = iF(pi, r1) + 37(p2.r2). The second inequality of (5.3.4) follows from 
Vi4i(z.7) being jointly concave in x and r by the induction assumption. It is clear that 
equation (5.3.4) guarantees that V; ; (r(y. p. r.c). r(p. r)) and thus E [V;; (x. r)] is jointly 
concave in p.y and r. Since the first three terms of (4.3.8) are jointly concave in y. p and 
r by assuption 5.1, J;(y. p, r) is jointly concave in y. p and r. 

We now show that V;(r.r) is jointly concave in z and r. From (4.3.7) we know that 


V an E ritr = max Ji(y. p. rmt E l (5.3.5) 
2 2 y2 2p 2 


Since J,(y.p.r) is jointly concave in y and p a base-stock policy with a base-stock level 
S*(r) and an optimal price p*(x.r) is optimal and thus the optimal profit V;(z.r) can be 
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written as 
VW(z,r) = A(S"(r), p'(z, r), r). (5.3.6) 


It furthermore follows that 


Lı HTa Ti T2 Ti To 
vI ———,——— ]= max 4 h| Dp, = 
2 2 y2 21422 p 2 
T3 dq T5 Z1 + Ta X4 a T2 Ti 4 T9 Ti tra 
=J S" PUES ET 3 3 er TOT X E > 


>J max (S*(r1), zi) + max (S*(r2), £2) p* (21,171) + p* (£2,r2) rı +72 - 
Zyt 2 , 9 ’ 9 Z 
„J (max (5° (r1) 2i) p" (21,71), r1) , Je (max (S" (r2) 22) ,p* (22 r2) r2) _ 
T 2 2 
M (21,71) $ V; (£2, 72) 
2 2 l 
(5.3.7) 


The first inequality of (5.3.7) holds since p* (21222, ntr) and max (S* (44%) , utaz 
are the global optima of V, ON 71272) . Any other solution, particularly P&P 627 
and Ban) max “(r2)22) will thus be less or equal to the optimal solution. J;(y, p, r) 


being jointly concave in y, p and r explains the second inequality of (5.3.7). L) 


Remark 5.4. Note that for the linear demand function defined in equation (5.0.1) the revenues 
pD,(p,7r, c) are not jointly concave in p and r and thus assumption 5.1 does not hold. 


Similar to sections 2.3.3 and 3.3 we can find a possible steady state for the integrated 
pricing and inventory control model given by equations (4.3.1) to (4.3.3). 


Theorem 5.5. If the system (4.3.1) to (4.3.3) admits a steady state, then for the linear demand 
function defined in equation (5.0.1) and the loss-neutral case ( 32 = {3 ) it is given by 


s e= Bo)(1 — ay) + &x(1 — Ye 


= 5.3.8 

Poo 380-oy4&-3 Pom 
Ded 

ys = E[D(P3o1 Poo: €)] + FT 5) | (5.3.9) 


Proof. 'The proof is a combination of the proofs of theorem 2.9 in section 2.3.3 and theorem 
3.1 in section 3.3. As in the proof of the joint pricing and inventory without reference effect 


case, we first find the steady state inventory. Again the o optimal total discounted profit for an 
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5.3. MULTI-PERIOD MODEL 711 
infinite time horizon is given by (compare equation (4.2.2)): 


x 
V(zı.r)=ca+ » AR [Dc c3 ED (Derr, Te 6)] — CCL - »)vea (5.3.10) 
s 3. 


TycD(pia. Tia: €t41) — G(yerı: Pia: T(43)] : 


where 


ycii- E[D(pi iis rii iei 1)] 
G(yrai Peas Tia) = Ji (Yai — ElD(perı, Tia: €41)] — u)f(u)du 


+b (u — uua E[D(pi+1; Tt43; €t41)]) f (u)du. 
yci i E[D(pi jani ise i] 
(5.3.11) 


Differentiation with respect to y and using the same arguments as in the proof of theorem 
2.9, we get the steady state base-stock equation (5.3.9) depending on the price. Note that 
the reference price has to be equal to the price as this is necessary for the steady state price. 
The steady state price in formula (5.3.8) is found analogously to the proof of the steady state 
for the reference price model (see theorem 3.1) after substituting y* (p) for y in the infinite 
sum (5.3.10) and differentiating with respect to the price p. This yields exactly the same 
steady state price p>, as in theorem 3.1. O 


Table 5.1.: Steady-state base stock and list price 


Remark 5.5. Note that the optimal steady state price p*. in equation (5.3.8) is the same as 
in theorem 2.9 in section 2.3.3 and the optimal steady state base-stock level y5. in equation 


(5.3.9) corresponds to the optimal steady state price obtained in theorem 3.1 in section 3.3. 
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As in section 2.3.3 and 3.3 one can study the behavior of the steady state solutions (see 
table 5.1) for the integrated model under consideration in this chapter. 


Remark 5.6. From remark 5.5 it is clear that a sensitivity analysis of the steady state price 
ps, of equation (5.3.8) is identical to the one described in theorem 3.3 in section 3.3 and thus 
the optimal steady state price p*, is decreasing in the memory parameter a, increasing in 
the discount factor y and decreasing in the reference effect |62]. 


Since F^! (Sie) is independent of both œ and |62|, p*, is decreasing in a and |62| 
and expected demand is decreasing in price, it is clear that E|D(p%,, p*,, €)| is increasing in 
a and |62|. As a consequence, y*, is increasing in a and |82|. However, for the discount 
factor y, we cannot make a definite statement about the behavior of the steady state base- 
stock y% , as the safety stock is increasing in y while E[D(p}.p%, €)| is decreasing in y. 
As a result, in case of small uncertainty in demand the base-stock is decreasing in ^; whereas 
for large uncertainty the base-stock is increasing in y. A numerical example is given in table 
5.1. 
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6. Simulations and Numerical 
Investigations 


The functional equations described in (4.3.1) to (4.3.3) and (4.3.7) to (4.3.9) in chapter 4 lack 
a closed-form solution, except in a very limited number of special cases, like the one-period 
case presented in section 5.1. Hence, the main focus of chapter 5 was on showing that an 
optimal solution theoretically exists and is unique, which then resulted in a simple optimal 
pricing and inventory control policy, like a base-stock-list-price policy. In this chapter we 
complement these analytical results with a numerical study. We shall relax some of the rather 
strong assumptions of chapter 5 in order to assess the robustness of its implications and give 
explicit optimal solutions. Furthermore we will investigate, how the optimal inventory and 
pricing paths evolve over time and give a qualitative understanding of the obtained results. 
Moreover we aim at examining whether the model's solutions have some additional structural 
properties in the reference price. 


We shall study a discrete time, discrete state Markov decision model, since the model 
under consideration is a periodic review model, and demand realization can only be integer 
values (we cannot sell a fraction of an item). We already described in chapter 4 that we are 
facing a two-dimensional state space (comprising an internal state describing the produc- 
tion system and an 'external' state related to the market) and a two-dimensional action space 
(reflecting both marketing and production/logistic decisions). Thus we are likely to run into 
the 'curse of dimensionality' (the tendency for the solution time to grow exponentially with 
the dimensionality of the state or action space) where some theoretical insights to efficient 
computer programming are required and the art of programming lies in finding a trade-off 
between memory-intensive and run time-intensive computational methods. 


All results of this chapter are obtained by a dynamic program using backward recursion 
developed in the numerical computing environment MATLAB. The Compecon MATLAB 
toolbox, provided by Miranda and Fackler (2002) and especially designed to solve stochas- 
tic dynamic economic models, could not be used for the integrated model including reference 
price effects due to the memory intensive data structure of their algorithms for higher dimen- 
sional state spaces. Therefore we developed a new algorithm particularly suitable for our 
setting described in chapter 4. In order to guarantee numerical stability and control round- 
off errors, we implemented a spline interpolation on the value function with respect to the 
state and decision space, which also reduced computation times significantly. 


By systematically varying the demand parameters o, £j, (2, the memory parameter 
a, the discount factor ~y, the costs c, b and A, and the length of the planning horizon T 
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we find strong evidence that a base-stock-list-price policy is optimal under any considered 
setting. Hence, we only present illustrative examples in sections 6.1 to 6.3. 


6.1. Loss-neutral customer behavior 


6.1.1. The optimal policy’s structure 


As a first step we will concentrate on loss-neutral customer behavior ( 95 = (3), which we 
investigated analytically in chapter 5. Although there we could only show the optimality of 
a base-stock-list-price policy for a two-period problem, we now find that this structure also 
extends to any time period of a finite time horizon with length T > 2. Extensive numerical 
studies give strong evidence that the expected profit Jı(y, p, r) is jointly concave in any time 
period 1 € £ € T in inventory level after ordering y and selling price p for any reference 
price r and thus a base-stock-list-price policy is optimal for any time-period £. Furthermore, 
we observe that the steady state solutions (5.3.8) and (5.3.9), derived analytically in section 
5.3, are already attained within a relatively short planning horizon (T = 15 for the below 
base scenario). 


Figures 6.2 to 6.9 on page 82 and following pages give an illustrative example for the 
following base scenario: Expected loss-neutral demand is given by the parameters Jy = 
100, 86, = —20 and 85 = —40, whereas random demand follows a normal distribution with 
fixed standard deviation o = 20. Because of the cost of capital, maintenance, insurance, 
loss and damage, the per period holding cost rates amount to approximately one percent of 
the ordering costs: c = 0.5 and h = 0.005. High service levels are ensured by setting the 
backlogging cost rates about the same magnitude as the ordering costs: b — 0.4. Moreover, 
a memory parameter is given by œ = 0.5, the discount factor is set to y = 0.8 and the total 
length of the planning horizon is given by T = 15. We want to mention here that all the 
results described below hold for any other tested parameter setting and interestingly, also for 
non-linear expected demand functions. 


Figures 6.2 to 6.5 are examples for time period t = 10. Figure 6.2 shows that the expected 
profit Jı(y,p,r) is jointly concave in inventory y and price p. Moreover, it can be seen 
in figure 6.3 that the optimal expected profit V;(x,r) is jointly concave in inventory z and 
reference price r and increasing in both z and r. Therefore a base-stock-list-price policy 
is optimal for any reference price r (see figures 6.4 and 6.5). Thus, for simplicity, the 
two three-dimensional graphs of the optimal inventory level y*(z,r) and the optimal price 
p'(z,r) can be reduced to two two-dimensional graphs of the base-stock level S*(r) and 
list-price P*(r), which can be seen to be increasing in reference price r (see figures 6.8 and 
6.9). 


We now take a closer look at the behavior of the optimal decisions y* and p* in the state 
variables x and r over time. In section 2.3.3 we observed that for the model introduced by 


Federgruen and Heching (1999) price discounts were given ata higher inventory level, the 
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Figure 6.1.: Diverse demand probability density functions 


more time there was left in the planning horizon (see figure 2.9). This does not necessar- 
ily hold for the integrated model. Here, due to negative carry over effects, price discounts 
are generally smaller and are given as soon as the inventory level before ordering is higher 
than the base-stock level (see figure 6.7). Since we give smaller discounts, we have to react 
earlier in time in order to move the inventory level before ordering below the steady state 
level and thus reach a possible steady state. The model in section 2.3.3 aimed at reducing 
base-stocks over time and thus list-prices were increasing over time as a consequence. For 
a fixed standard deviation of demand, list-prices tended to be constant over time (see figure 
2.8). In contrast, the integrated model including reference price effects behaves qualitatively 
completely differently. Here, similar to figure 3.3, list-prices are decreasing over time in 
order to benefit from the reference price effects and are increasing in reference price r (see 
figure 6.9). With decreasing prices over time (see figure 6.7 and 6.9) and also the result- 
ing reference price effect, expected demands are increasing, necessitating higher base-stock 
levels over time (see figure 6.6 and 6.8). 


6.1.2. The influence of the demand distribution 


In this subsection we want to investigate the influence of different demand distributions and 
coefficients of variations. 


Fig. 6.1 illustrates the different shapes of the demand's probability density functions for 
several demand distributions (truncated normal, log-normal, negative binomial, and beta) 
with the same mean and variance. Note that the log-normal, negative binomial, and beta 


distributions have considerably heavier tails than the correspondin ng truncated normal and 
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Figure 6.2.: Expected profit in price and inventory (loss-neutral) 


o o o 38 
2 rel e D 


(1'x)A youd jewndo pajoedx3 


350 


— 25 


2.3 


m 


2.1 


1.9 


Reference price r 


Inventory before ordering x 


Figure 6.3.: Optimal expected profit in reference price and inventory (loss-neutral) 
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Figure 6.5.: Optimal price in reference price and inventory (loss-neutral) 
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Figure 6.6.: Base-stock in inventory level for different time periods (loss-neutral) 
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Figure 6.7.: Optimal price in inventory level for different time periods (loss-neutral) 
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Figure 6.8.: Base-stock in reference-price for different time periods (loss-neutral) 
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Figure 6.9.: List-price in reference-price for different time periods (loss-neutral) 
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uniform distributions, respectively. All three of them are skewed to the left (their mode is 
smaller than the expected value) and allow only for positive demands, therefore there is no 
need for truncating negative demands as in the case of the normal distribution. 


In addition to studying the effects of different demand distributions with the same mean 
and variance, we also probe the influence of different variable demand variations. Instead of 
considering a fixed variance c? as in section 6.1.1, we analyze the effect, when the stochas- 
tic term € of the demand function (e.g. (5.0.1)) follows the truncated normal distribution 
function with mean zero and variance o? = c.v. - E[D(p, r,c)], where c.v. denotes the 
coefficient of variation. Note that for a constant coefficient of variation c.v. and price p, 
higher reference prices r result in a higher expected demand and thus a greater demand 
variation c? . 


Extensive numerical studies show that whatever demand distribution or coefficient of vari- 
ation is used, again a base-stock-list-price policy is found to be optimal. Thus in the fol- 
lowing we only consider the effects on the base-stock and list-price levels. For obtaining 
figures 6.10 to 6.15, again the base scenario from section 6.1.1 with G9 = 100, 6, = —20, 
Bz = —40, a = 0.5, c = 0.5, h = 0.005 and b = 0.4 is presumed. The effect of dis- 
counting prospective cash flows is excluded by setting ^; — 1. Furthermore, a coefficient of 
variation c.v. = 1 is used, where not denoted differently. 


We now analyze the last time period of the planning horizon t = 15. Since it is very costly 
to have unsold inventory on hand after the last time period, the main aim here is to reduce 
as much as possible the risk of not selling the inventory on stock in the last time period. 
The higher degree of system uncertainty is - that is, either a high coefficient of variation 
or a heavy tail distribution, the more the retailer aims to decrease the standard deviation 
of demand. This can be obtained by reducing the mean demand, since then the standard 
deviation is reduced by the same proportion. As demand is a decreasing function in price it 
is to respond to an increase in system uncertainty by increasing prices (see figure 6.11 and 
6.15). This in turn results in a decreasing optimal base-stock level (see figure 6.10 and 6.14). 
Furthermore, note that in contrast to section 5.1, list-prices in the last time-period do depend 
on the demand distribution and variation (see figure 6.11 and 6.15) , since now the variation 
of demand is not fixed but depends on price and reference price. 


However, in earlier time periods, the dominating objective is not to clear stock, but to 
optimize long-term profits. In order not to incur expensive backlogging cost, the aim is to 
have sufficient inventory in stock. As we discussed above, it is clear that for a heavy tail 
distribution the risk of high demands is higher than for symmetric distribution functions. 
Thus the optimal policy is to increase the inventory stock level for a higher degree of system 
uncertainty (see figure 6.12), which in turn results in lower optimal prices (see figure 6.13). 
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Figure 6.10.: Base-stock in reference price for diverse demand distributions (t = 15) 
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Figure 6.11.: List-price in reference price for diverse demand distributions (t = 15) 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


88 CHAPTER 6. SIMULATIONS AND NUMERICAL INVESTIGATIONS 


350 
—-D-truncated normal BR 
---D~lognormal ee 
300+ |---D~negative binomial 
AR D-beta 
& 250 
x 
8 
[6p 
2 
& 200r 
150 


1085 1 1.5 


29 3 3.5 
Reference price r 


Figure 6.12.: Base-stock in reference price for diverse demand distributions (t = 5) 
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Figure 6.13.: List-price in reference price for diverse demand distributions (t = 5) 
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Figure 6.14.: Base-stock depending on coefficient of variation (t = 15) 
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Figure 6.15.: List-price depending on coefficient of variation (t = 15) 
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6.2. Joint versus sequential optimization 


Research such as Whitin (1955) has already shown that the simultaneous determination of 
price and ordering or production quantity can yield substantial revenue increase. The coordi- 
nation of price and production decisions potentially increases profit and thus results in more 
efficient supply chains. In this section we shall explore the size of possible benefits when 
using joint optimization compared to sequential optimization via numerical simulations. In 
subsection 6.2.1 we will give a short review with extensions of the results obtained by Fed- 
ergruen and Heching (1999), while in subsection 6.2.2 we investigate how benefits change, 
when reference price effects are included in the model. 


6.2.1. Classical operations research models 


Consider the following ad-hoc, but not unrealistic, mode of operation in which the marketing 
and production decisions are made in stages. The model is decomposed such that marketing 
seeks to maximize its objective function first and the production decision is made second. 
The reason for the suboptimality of the separated model is that the two parties, marketing 
and production, are considering two different objective functions. Since marketing is taking 
demand fulfillment for granted, its objective reduces to maximizing expected revenues. In 
contrast, production also takes into consideration inventory costs. Thus the optimal produc- 
tion decision always depends on the actual inventory in stock. 


Figure 6.16 depicts the gains of jointly determining an optimal price and inventory level 
versus the sequential procedure, where marketing first determines the profit-optimal price 
p; = -(Po-Pıc)/(281) and then the production unit decides on an optimal stocking quantity 
without having the option of changing the price. The largest benefits of joint optimization are 
obtained towards the end of, or for a short planning horizon. In contrast to the comparisons in 
Federgruen and Heching (1999), who base all numerical results on a coefficient of variation, 
we always use a constant standard deviation in this section (not depending on price). This 
erodes a lot of the benefit when using a dynamic pricing model. Figure 6.17 shows relatively 
low benefits for low stock before ordering, which can be much higher for substantially larger 
inventory levels before ordering. The closer we get to the end of the planning horizon, the 
earlier this effect can be observed. This is intuitive as the seller tries to reduce the risk of 
being left with unsold stock at the end of the planning horizon. 


6.2.2. Integrated model with reference price effects 


In this subsection we investigate the differences between a sequential and a joint optimization 
approach for integrated models including reference price effects, introduced in chapter 4. As 
in subsection 6.2.1 we will explore the size of possible benefits when using simultaneous 


compared to decomposed optimization. l l l 
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Figure 6.16.: Profit increase over time 
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Figure 6.17.: Profit increase in inventory level xg 
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Sequential Optimization: Joint Optimization: 


Marketing: 


max, DI oat (per — N E[D(pecis re] 


= p*(r) = [pi(r1),-., P3 (rr) mary,» Yeno V (per) ELD (piii, re1)]- 
>r" = [rfr] +e(ye+1 — 2641))— 
-G(%+1,Pt+1, Tt+1)) 


Production: * " * 
=y (x, r) = [yi (zi , T1) 2) yr(zr, rT)| 


max, Dino ¥'[(pha1) ELD(b? y thy )]+ => p'(z,r)  [pi(z1, 71), PT (zT, "T)] 
+c(y+ı — Tt+1))— 
—G(¥e41, P£44)] 


= y'(z) = [yi (21), yr (zr)) 


Optimal total expected profit: seqV- (1,71) Optimal total expected profit: joint V7 (21,71) 


Figure 6.18.: Sequential optimization of price and inventory vs. joint optimization 


Figure 6.18 describes that in a sequential approach first the marketing/sales department 
determines an optimal price without considering any inventory decisions and taking demand 
fulfillment for granted. It is clear that the optimal price p*(r) in the decomposed model 
only depends on reference price r . This price is then passed on to the production unit of the 
company, which then decides on an optimal stocking quantity without being able to change 
the price. Here the optimal stocking quantity y*(x) of course only depends on the inventory 
level before ordering x. In the joint approach, both decisions are taken simultaneously and 
thus the optimal price p*(x,r) and the optimal stocking quantity y*(z,r) are both a function 
of inventory z and reference price r. Hence, with this better possibility of reacting to the 
system dynamics, it is obvious that a simultaneous optimization yields higher profits than 
a sequential procedure although the sequential approach is already highly sophisticated by 
incorporating non-stationary prices, which vary over time. 


To obtain figures 6.19 to 6.24, the parameter set in table 6.1 is used. In figure 6.19 and 
6.20, we compare the optimal base-stock and price/reference price paths for the sequential 
and joint approach from figure 6.18. For the time being, we assume rı = př, and zı = 0 
to avoid having a transient phase at the beginning of the planning horizon. Furthermore, we 
assume that the actual demand realization D(p;, Tt, €t) = E|D(p, rt, €;)] in any time period 
t . Using joint optimization, price p; and base-stock y; leave their steady states later in time. 
This is because we have the opposing strategies of benefitting from the reference effects by 
lowering prices towards the end of the planning horizon (see figure 3.3) and aiming at a 
clearance of stock at the end of the planning horizon (see figure 2.8). 
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The question now is how such a joint optimization of price and inventory increases the 
benefits over the sequential optimization. Figure 6.21 shows that similar to figure 6.16, 
we get the largest benefits of joint optimization towards the end of, or for a short planning 
horizon. In contrast to the comparisons in Federgruen and Heching (1999), who base all nu- 
merical results on a coefficient of variation, we always use a constant standard deviation (not 
depending on price). As in subsection 6.2.1, this generally results in relatively low benefits. 
Again, higher benefits can be obtained for substantially larger inventory levels before order- 
ing (see figure 6.22 for the last time period t = 50). Similar to figure 6.17, this effect can 
be observed for any time period t. However, in comparison to the model without reference 
effects (compare subsection 6.2.1), for smaller ¢ this effect only appears for inventory levels 
before ordering much higher than 200. This is because the pricing strategy under reference 
price effect enables us to clear higher stock levels in later time periods. 


Value | Description 


DN intercept of the underlying demand model 
scaling factor of the price effect 
scaling factor of the reference price effect 
.005 
0 


Table 6.1.: Sample parameter set 


LEN 
CENE 
CNN 


Figure 6.23 shows that the benefit of the joint model with reference effect is at least 10 
times the benefit of the model without reference effect, and is considerably higher when the 
reference effect increases. While in the classical setting, price is only varied to control inven- 
tory, here price has its own dynamics, and incorporating the influence of the reference price 
increases the benefits of integrating pricing and inventory control significantly. Moreover, 
a significant difference to the model of subsection 6.2.1 is that while there the benefit con- 
verges to zero for long planning horizons, here the benefit converges to a value considerably 
higher than zero (depending on the parameters chosen). This effect is more prominent the 


more the starting reference price differs from the epum steady state price (see y ais 6.24). 
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Price/ Reference Price 


Base-stock 


1.9 


1.06 
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—-optimal list-price (joint optimization), rı = p* = 2.16 
-0-optimal price (sequential optimization), rı = p* = 2.16 
-o-optimal reference price (joint optimization), rı = p* = 2.16 
-a-optimal reference price (sequential optimization), rı = p* = 2.16 


30 35 40 45 50 
Time period 


Figure 6.19.: Price path (sequential vs. joint optimization) 


—e—optimal base-stock (joint optimization), r1 = p* = 2.16 


—a-optimal base-stock (sequential optimization), rı = p* = 2.16 


30 35 40 45 50 
Time period 


Figure 6.20.: Inventory path (sequential vs. joint optimization) 
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Percentage increase in profit 
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Figure 6.21.: Expected profit increase over time 
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ma. — Time period t=25 
1.5% n c ---Time period t=45 
= Time period t=50 


1% 


Percentage increase in profit 


o bd 
2 R, 00 -50 0 50 100 150 200 
Inventory level before ordering 


Figure 6.22.: Expected profit increase in inventory level zo 
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2% T mr 


Time period t=40 (out of 50) 


096 
Ba— B3—0 Ba= b3 = —20 Ba — 83 = —40 Ba— 83 = —60 


Reference effect 


Figure 6.23.: Expected profit increase in reference effect 


Time period t=1 (out of 50) 
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Figure 6.24.: Expected profit increase in reference price rj 
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6.3. Extensions 


In this section we will give some extensions that are worth considering, but beyond the 
scope of this thesis. As they could be a subject of further research, we have chosen to give a 
preliminary idea of the results and difficulties here. We are going to address the issue of loss- 
averse and loss-seeking customer behavior and then give a brief description of what happens 
when fixed ordering costs are included in the model. 


For loss-averse customer behavior, where consumers respond more to surcharges than to 
discounts (da < 03), we already found in section 5.1 that a base-stock-list-price policy is 
optimal (see figures 6.25 and 6.26 for the last time period £ — 15). However, we could also 
show in section 5.1, that although the optimal price is increasing in reference price, this no 
longer holds true for the optimal inventory level. We can see in figures 6.27 and 6.28 that 
this behavior extends to any other time period. Since the optimal price path is monotonous 
over time according to Popescu and Wu (2007) (compare section 3.3.2), the optimal price 
thus converges monotonously to a steady state, which depends on the initial reference price 
level (see figure 3.6). Thus there are grounds for the supposition that the same is true for the 
base-stock level. 


In the case of loss-seeking customers, where the demand function is deeper for gains 
than for losses and consumers stockpile when prices are low (a > (3), we can see by 
the example of figures 6.29 and 6.30 that a base-stock-list-price policy is again optimal. In 
contrast to the above discussed loss-averse customer behavior, the base-stock and list-price 
levels not only lose their monotonicity in reference price, but also continuity. As already 
described in section 3.3.2, the jump discontinuity in the optimal price results in cycling 
policy over time and thus it stands to reason that the optimal stocking quantity will also 
cycle over time. 


As a last extension we consider the case where the ordering costs also include a fixed 
cost component for loss-neutral customer behavior. In contrast to section 6.1.1 we find that 
here a simple base-stock-list-price-policy is not optimal. Chen and Simchi-Levi (2004a) 
have already shown for an integrated pricing and inventory control model without reference 
prices, that in the case of fixed ordering costs an ( s. S. p ) - policy is optimal: If the inventory 
level at the beginning of period £ is below the reorder point, s, , an order is placed to raise 
the inventory level to the order-up-to level, S,, and a price P, is charged. Otherwise no 
order is placed and a different price p,(x) is offered, which is decreasing in inventory level 
r. Figures 6.33 and 6.34 show that this result extends to the integrated model including 
reference price effects. In the case of included fixed ordering costs, in contrast to loss- 
seeking customer behavior, a jump discontinuity happens in inventory level z instead of in 
reference price r (see figures 6.35 and 6.36). 
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Optimal inventory y* (x, r) 
> 
o 
l 


Inventory before ordering x 


Figure 6.25.: Optimal inventory in reference price and inventory (loss-averse) 
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Figure 6.26.: Optimal price in reference price and inventory (loss-averse) 
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Figure 6.27.: Base-stock in reference price for different time periods (loss-averse) 
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Figure 6.28.: List-price in reference price for different time periods (loss-averse) 
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Optimal inventory y* (x, r) 
> 
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Figure 6.29.: Optimal inventory in reference price and inventory (loss-seeking) 
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Figure 6.30.: Optimal price in reference price and inventory (loss-seeking) 
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Figure 6.31.: Base-stock in reference price for different time periods (loss-seeking) 
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Figure 6.32.: List-price in reference price for different time periods (loss-seeking) 
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Optimal inventory level y* (x, r) 
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Figure 6.33.: Optimal inventory in reference price and inventory (incl. fixed costs) 
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Figure 6.34.: Optimal price in reference price and inventory (incl. fixed costs) 
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Figure 6.35.: Base-stock in inventory level for different time periods (incl. fixed costs) 
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Figure 6.36.: Optimal price in inventory level for different time periods (incl. fixed costs) 
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7. Summary, Conclusion and Future 
Research 


This thesis addressed the problem of simultaneously determining a pricing and inventory re- 
plenishment strategy by combining two literature streams: the operations orientated stream 
and the marketing orientated stream. In order to benefit from the strengths of both research 
areas, we combined the rich costs models, commonly used in operations research, with 
sophisticated demand models, mainly applied by marketing so far. The integration of the 
consumers' willingness to pay with pricing and inventory control models increases the di- 
mension of state space of the underlying dynamic program, which substantially increases the 
model's complexity. Within this context we studied how the additional dynamics affect an 
optimal policy and whether a simple policy such as a base-stock-list-price policy still holds 
in such a setting. 


For the one-period case we could analytically prove the optimality of a base-stock-list- 
price policy under very general conditions. Furthermore, we showed additional structural 
properties in state space, describing the consumers' willingness to pay. However, due to the 
added complexity of the model, an extension even to the two-period version of the problem 
evoked major complications in analytical tractability, since the value function is no longer 
"well behaved’ and thus commonly used proof techniques could not be applied. With tedious 
and extensive mathematical investigations, for the linear and loss-neutral demand function 
we proved the optimality of a base-stock-list-price policy in the two-period setting. We 
also suggested a way of proving the optimality of a base-stock policy for the multi-period 
case, which only holds under very restrictive assumptions. However, we were able to give 
useful and explicit steady-state solutions for the multi-period setting, provided that such a 
steady-state exists. Extensive numerical studies suggest that the optimal solutions converge 
relatively quickly in time (for reasonable parameter settings, a convergence could even be 
observed within fifteen time periods). 


Using numerical simulations, we extended the results obtained analytically to more gen- 
eral settings, such as a larger planning horizon, more general demand functions, or a more 
complex cost structure. Moreover, we investigated the potential increase of profit by si- 
multaneously determining optimal prices and stocking quantities compared to a sequential 
optimization, where prices are set first by the marketing department of a company and then 
the production unit decides on the optimal stocking quantity, without being able to change 
prices. We found that the benefits increase considerably when reference price effects are in- 
cluded in the model. By using constant standard deviations of demand, we achieved at least 
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ten times the benefit attained by a joint model without reference prices, which makes an in- 
tegration of pricing and inventory control with reference price effects by all means worth the 
effort. 


In this work, first steps were attempted toward obtaining analytical structural results. How- 
ever, due to the complexity of the model, general proofs could only be given under rather 
restrictive assumptions. For further research, it would be worthwhile investigating, whether 
the attained results also hold for more general demand functions and multi-period finite plan- 
ning horizons. Furthermore, it would be interesting to see whether the results obtained for 
the finite horizon case can also be extended to the infinite horizon case. We suggest that more 
work be done for the case of non-zero fixed ordering costs and loss-averse customer behav- 
ior. We also recommend that more efficient algorithms be developed that would be more 
applicable in practice, since by facing a two-dimensional state and action space computation 
times soon become very run time intensive. 
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A. Auxiliary Calculations 


Lemma A.1. The one-period-profit-functions MR (zi, yi. Pı.rı,cı) and IP (xi, yi pi: rı.cı) 
are both jointly concave in y, and p, and for the one period profit TI (£1, Y1. Pı. r1. €1) we 
obtain the following results: 


?*Ih(zi.yi. Pr. i. €1) 


A711, (21. 91. P1- r1. €i) 
—————MM— —— ——À - 0. 2 
Oy? i = 
HT (21. Y1; pi. ri. €1) 
= I). (A.3) 
Oy OD} 


Proof. Equation (5.2.4) yields 


OM? (zı. Y1-P1,71; €) 


OD(pı.rı.cı) aa PG) 
Op, 


for e < Yı > E[Di(pi. (r1. €1)| and 


OWT? (x1. Y1; pi. T1. €1) 


OD(pi.ri. €) ag9PGi nii6) 
Opi l 


= D(p.rı.a) + Pı dp, rm 


for e > yı — E[Di(pi, ri, €i)] . Furthermore it follows directly that 


PM} (21. Y1; pis r1. €1) _ ,9D(p.n.&) 9^D(pi.ri.€) 
AE s MDC mar ccs 


^ D(n. Pier] 
e pe UN 
Op? dp, Op} Op} 


2rıb 2 2 
ð irt: y Pas Py e) 2 oP (Pi: ni. &) 25d Dipi. ri e) _ 42 Dip tie) rai 
Opi Opi Opi Opi 
For the linear demand function (5.0.1) it is clear at O?Tl; (21. 1. pi. r1. €1)/0p2 = 2(814- 2) . 
The partial first and second derivatives with respect to y; similarily reduce to: 


Oy -b .eı 2 yı — E[D, (pi. 11. €1)| 
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=i 
Oy? 


Since Oll (21,91, 91,71, €1)/Oy, is a constant, it is clear that the mixed partial second deriva- 
tive reduces to zero. 
Tl (21, Yı; Pi; T1, a) 


=. 
Oy Op, 


Moreover, the determinant of the Hesse matrix is given by 


EIN (21, 91, Pi; Ti, €1)] 9 E[Ih(zi y1, Pr, 71, €1)] u PET (21,91, P1, "1, €1)] —-0 
dy? Opi Op Oy, 

Hence, the Hesse matrix is positive semi-definite, which ensures that E(IT (£1, yı, p1, 11, €1)] 

is jointly concave in pı and yı. N 


Lemma A.2. The expected optimal profit of the second time period VJ (r2(pi,ri1)) is jointly 
concave in y and p,. Furthermore, we obtain the following results for the optimal list 
price Py(ra(pı,rı)) and the optimal profit Vz (ro(pi,r1)) : 


OP? (ra(pm,r1)) p(l- a) 


ð — ^ JB 3 Bs = 
Fur) rn (AS) 
Pun) zy (A.6) 
ne - 0. a) 


Proof. Differentiating P»'(ro(pi,r1)) implicitly with respect to p; , yields 


dE[D(p2,72, 2? E|D(p2 r2, 
[ pun e2)] + (pa — c) [D (p2.72.€2)] 


OPy(r((mp.n) —— Bpór; 
m OE|Da(pa ra, 9? E[D ‚€ 
Opı 9 | um 2,€2)] + (po — c) pens: 2)) ° 
For the linear demand function (5.0.1) the above result simplifies to 
A OE[D(p2,7r2,€ 
OP (rar) ^ &-o) 
Op, Een 2 


2(Bı + Be) 
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We now use these results to find the partial derivatives of V (ro(pi, r1)): 


9V7 (ra(pi. rı)) _ 8E|l(zo(ui. pi, 71.6) S3(r2(p1:11)); Pz(ro(i. 71)). rz(mi. 1). €2)] 
Op, Op 


= OP3(r2(P1,71)) ; E[D(Pz (ra(pi. ri)). ra(pı: r1). €2)]- 


Opi 
4 (Pž(ra(p.r1)) 2 C2) . Ten cann T1). e)] 


From above we already know that OP7(ro(pi.r1))/Opi reduces to a constant. Furthermore 


we obtain 
— Bo + c(Bı + B3) + Ball — a)pı + rary 
2(81 + 3) 

" —(1 — a)Bopi — Pear, + Bo + c(£i + 
ELD(Py roi. n) talpi: ra), en] = E Bp Pe Ao e PA) 

OE|D(P2"(r2(p1:11)): 029 0)): €)] - u — a) Bo 

Op, 2 

This simplifies the calculation of the second order derivatives: 


OVi(r(m.ri) _ O° E[flz(zo(yi. Pr. 1.61). S2*(r2(pı; r1)); Pe" (r2(i m): r2(p1:r1))] 
Opi Opi 
9 P» (r2(Pr-71)) . OE|D(P2*(ro(pi.71)), ro(pi. ri). €2)] 
Opi Op 
Bl — a) Aa) po _ B3(1— a)? 
At) 2 2(8| +) 


OV; (ro(pi. r1)) _ OE|[llo(zo(yi. 1.1.61). 53 (T2(p1- rı)): Po" (r2(pi.71)). ra(pı- 11). €2)] " 


Py'(ro(pi,r1)) = 


Oy Oy 
OV (r2(p1.11)) _ FEN: (z2(yı: P1. 1.61). S2(ralpı; rı)): Po (r2(pi r1)): r2(pı.r1))] =Ü 
Oy; Oy; l 
PV (ra(pr. n1)) ed 
Oy Opi 


Hence, the determinant of the Hesse matrix is given by 


Vs (ro(py. r1)) 9* V7 (r2(pi-71)) _ 8*V7 (r(pi. r1)) 


ET 
Oy? Op? Op Oy; 


Since by the above equation the Hesse matrix is positive semi-definite V,"(r2(pi.71)) is 
jointly concave in yı and pı. 


Lemma A.3. Using the short notation r2 = xo(yi, py, T1. €1), T2 = Ta(pı.rı) and E|D]* = 


EID 25. T3). T3. €3)| we obtain the following results for the optimal price r3) and 
ID (pa (vo. r2). a. 2) f lisa Cimol Vac PO 78- ur e Pilze; 2) 
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the optimal profit to go %3(X2, r3) : 
Opa" (22, T2) _ (1 — 0)65/ f(zx; — E[O]*) + (h + b)(8i + 85)(01 + ah) 


Bp. GORGE BGB] co P 
Ope" (2,72) _ —(h +b) (A.9) 

Oy 2/f (x2 — E[D]*) — (A + b)(61 + Ba)’ 
0°U3 (x2, r2) ze (A + b)(8y + 83)* (By + 52) + (1 — a} 63/ f(z2 — E[9]") (A.10) 

pi (bı + B)(2/ f (£2 — E[O]*) — (h + 6)(51 + 82)) i 
9 (ma, T) __ 2(h +b) (All) 
Vi 2/f(z» - E[O]*) - (h-k 5 + fh) ^ i 

0?995 (5, T2) _ (h + b)(2G, + (1 + a) Go) (A.12) 

nu 2/f(z2 — E[9]*) — (h+b)(Aı + b2) 7 


Proof. By differentiating p2*(x2,r2) implicitly with respect to £2 we obtain for the linear 
demand (5.0.1) 


3p" (z2, 72) _ —(h + b)E[ Dp (p5, 72, €2)] f(x» — EDI) 
chip 2E|Dy, (p5, r2, €2)] ~ (h + b) ) (E[Dp, (P3, T2, &)])” f(zo = E[D]*) 
—(h+b) 


“B/F BD") (h + BR BD] 


Likewise we obtain 


Op3(22,72) _ —E[Dro(p3, r2, €2)] (1 — (h+b)EID„ (P3, 72, €2) | f (x2 — ElD]")) 
Or; 2E[D,, (p3, 72, €2)] = (h + b) (E[Dp, (p3, T2; &2)])” f (za ~ E|D)*) 

_ _ Pal far — E[D]*) = (h + b)(i + Be) Bo 0 

(& + b2)[2/ f(xa - E[D]*) — (h + b)(B1 + 8 ~ 


We now differentiate p3(x2,r2) with respect to the price of the first time period p, and 
inventory level of the first period y. 


Op3 (za, T2) = Opa (za, T2) Oz? ER Opz(za, T2) Ore 

Op OX, pı Or, = Op 
Op3(X2, T2) M 

Ór» 


1 — a) 


E NL +R)+ 


2, + SP = Ion) — (h + 6)(8i + &2)] 


Opa" (15, r2) u Opa" (z5, r2) _ —(h t b) 
ovi Ox2 2/ f (ta, ET La Guo DEB S 2)» soo 
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By substituting h by h — yc the optimal expected profit of the second time period is given 
by equation (5.2.3): 


$95(z5. r2) = Elll(za. £2. pa" (2, r2). 72. €2)]. 
Equation (5.1.1) gives: 
E|II(x2. £2. p2' (zo. r2). r3. €2)] = po" (zo. r2) E[9]* — 


zı- E(D] 
-h-a [^ ^ (m - EID) = u) Su)du- 


=X 


= orf u (E[O]|* — ro + u)f(u)du 


In order to differentiate E[II (£2, £2. p" (z2. r2). r2. €2)| with respect to p; we need some 
further technical results: 


OE|D(pa' (zz. r2), r2), €2] _ OE|D(p3. r2, €2)| Op2*(X2. r2) " OE|D(p5. ra. €2)| r2 
Op, Op» Op, Ore Op; 
| (17 a) So/f (x2 — E[D]*) — (h+b)(Aı + 2)? 
2/f(x» — E[D]*) - a + b)(B, + G2) 


OE|D(pa'(z2. 72). T2, €2)] _ OE|D(p3. r2. €2)] Op2*(x2. r2) 
Oy Op» Oy 
E —(h + b)(8 + p2) 
2/ f(x - E[O]*) — (h + b)(B, + 82) 


Accordingly, one can easily calculate 


O(x2 — E[D(p2" (£2. r2. €2),72))) _ __(261 + BC +a))/ fa — ELD") > 0) 
Opi 2/f(z2 - E[D]*) — (h + b)(8y + b2) 7 

O(z» — E[D(pz* (22, 2). ra. €2)]) _ 2/f (z2 - E[D]*) > 
Oy 2/f (£2 - E[D]*) - (h+6)(B, + Ba) ^ — 


Differentiating E|II(z2. £2. po*(L2.T2). 2. €2)] gives the derivatives of Vý (£2, r2): 


OE[I (ro. 22. po* (22. r2). r2. €2)] " Op." (x2. T2) 
Op, Op, 


(n= 49 REED (23 - ED) + (b+ 5 


OE|D|* B 
Op, 


- F(z: - E[D]*)) 


"di t p2' (za. r2) 
EDI. 
E 


Since from above we know that QE[D(ps*(z». r2). r2. c2)]/Opi reduces to a constant, it is 
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easy to calculate the second-order derivative: 


O^ E[TI(z2, £2, p2* (12, r2), T2, €2)] " 
Op? u 
_ 9 ZH (12,72) OE|Do(po*, r2, €2))] — (h 4- b) (2 


at — EDV? bes 
— an ZEN) fe,- Eo) = 


| (h + D(& + 83) (By + o2) + (1 - a)’ / F (£2 — E(]") 
(8| + &3)(2/ f(x; — EID]*) — (h + b) (8 + Bo) 


Furthermore, differentiating $05(z5,r2) with respect to y; yields 


dE[D]* 5 
Oy 


1 — F(z: — E[D]*)) 


OEI(z2, x2, p3(z2, r2), r2, €2] _ Op2* (x2, r3) 
Oy oy 


-(h = 49282 ED) r(a, - IDI) + 0+ 70) 


E{D]* + p" (12, r2) 


O(z2 - E[9]") 
Opı l 


Since OE[D(p2* (£2, r2), r2, €2)]/Oyı also reduces to a constant, again it is easy to calculate 
the second-order derivative: 


OPE [I ro, o, p3(T2, 72), ra 62)] _ 
Oy? " 
_ pz" (E2, T2) OED)" Ax, — E[D]*\” "on 
team) apte ra en 
_ AHH) 
2/f (z2 - E[D]*) — (h + (0 + b2) ~ 


For the mixed second-order-derivatives of $05(x5, r3) we obtain 


O^ E|T(2, 22, p3(22, r2), Ta, €2)] _ 


Op, dy 
_ OP2(22, r2) dEID(P 72, €2))] | Opslza,r2) OE|D(p3, 72, €2))| _ 
Opi Oy Oy Op, 
u Or» = E[D]* Qr, — ED)” E D Y 
(hg) RU far - BDI" 
(h+ *(0*o)8) — 


— 9/f(zs — ED") - (h + b)(i +) ~ 
E] 


Lemma A.4. Let x2(yi, 71,71, €1) = S2' (r2) — 6, with a constant ô. Then the list-price 
equals the optimal discounted price for any realization of y, py, r1 and €i: 


P;'(ra(py,r1)) = po" (zo(yi, P1, r1, €1), r2(p1, r1)). (A.13) 
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Proof. In the case of ó > 0, using the above assumptions, the profit of the second time- 
period reduces to 


E|H(xo(yi. pi. T1. €1); S2' (ra(pi. r1))); P2; ra. €2| = 


0 


y 


=p E[D»(pz. r2. €2)] — c(S2* — 22) — (h — y) f (y? — u)f(u)du- 


=x 


- t9 f Super dues 
y 


0 


=PrE|Da(pa,72,62)| = e(5) - (h = 90) f (P — uud 


- 39 f (u 3) fludu 
i 


In case of 6 € 0 


E|II(z2. z2. pa. ra. €2)] = 
y'-ó 
=PrE|Da{pe.12-€2)] - (h= 70) | (= 6 w)flu)du 


— (b+ yc) I. (u — y? + ô) f(u)du. 


y’-6 
Differentiation yields 


OE|II(zo. S2*(r2(pı. r1))). pa. 2. €2] _ OEM (ro. £2, po. r2. €2)] _ 
Op» Op» 


= E|[D»(pz. r2, €2)] kei ra: €3)]) 


which gives the optimal value for P3*(r2(p2.r2)) and po(xa(yi. pi. r1. €1). r2(pi. r1)) , by 
setting the above equation equal to zero and substituting rz = or; + (1 — a)ni: 


py(zo.ra(p, 1)) = Pe"(r2(pi.t1)) = po(zo ro(m.(7i)) = UR em. 


Lemma A.S. For the linear demand function (5.0.1), «1 = y — ElD(pı.rı.cı)] and a 
constant ô > 0 the following holds for the profit of the first period Il, (z1. yi. pi. 7i. €1): 


lim [Mi (£1; Y1; Pı ~ 9.71.61) — (z1. Y1. pı 0.73. 63)] = 0. (A.14) 
lim Mh (x1. y1 — ô. py, 1.6) — Mi (£1. Y1 +ô, pi. mi €1)] = 0. (A.15) 
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lim 


a 1,4) Olh(zi Yyı,Pı+ö,r,6) 
ó—0 


Op, Op | = —(f + B3)(h + b) > 0, 


(A.16) 


ii [amn cam nen - E =(h+b)>0. (A17 


50 Oy Oy 


Proof. Let & = yı — E|D(pi,11,€1)], yı be fixed and ô > 0 a constant. Note that for 
linear demand E[D(pi + 6,71, €1)] = E[D(pi, 71, €1)} + (Ai + 82)6 . 

Substituting p; by p; — ô lets us approach the kink at e; = y; — E[D(pi, r1, €1)] from the 
left side in p, . Hence the inventory before ordering in the second time-period z2 becomes 


22 = yı - ElD(pı - 6,71, €1)] — € 
= y - ElD(pı - &,r1,&)] — yı + ElD(p, ri €1)] 
= yı - ElD(p, ri €1)| + (A + b2)ô — yı + E[D(pi, ri, &)] 
=(A+B)ö<0. 


Since zz < 0 it is clear that II, (£1, y1, pı — 6,71, €1) = IE (z1, yı,pı ^ 671,6). 
Substituting p, by pı + ô lets us approach the kink at € = yı — ElD(pı, r1, cı)] from the 
right side in pı. Hence, the inventory before ordering becomes z = -(ßı + b2)ô > 0, 
which gives IIlj(zi, yi, pı + 6,71,41) = If (zi,gyu pi + 6,71,€1). We are now ready to 
calculate 


lim [I(zi, 915 Pa ~ 6,71,€1) - Hi (21, y1, Pı + 6,71, €1)] = 
lim [IR (£1, y1, pi -óry&) - I? (21, yi pi + ô, 73, €)] ; 


which gives 


lim [(p1 + )(E [D(py,r1, €1)] + (Bı + 82)ó + €1)— 


— c(yı - 21) - (E[D(p1, 71, €1) + (61 + 82) + 6&1 91) — 
- (p - 9(E[D(pi ri &)] — (Ai + B2)d + €1) + elyı - 2) 
+ A(y - ElD(py,r1,6)] + (81 + B2)6 — &)]. 


By substituting «€; = yı — E[D(pi,ri, €1)] we obtain 


lim [TÈ (z1, ji 767,6) - DÊ (T1, ym + 6,71, €1)] = 
lim [-291ó + 2(81 + 8»)mó — (b — h)(& + b2)ô] = 0, 


which proves equation (A.14). l 
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Furthermore we obtain 


lim Ze _ una _ 
7 Op, Op, 

lim [men init - E | 
= Op, On 


which results in 


- lim [EID(p - .rj.68)] + à + (81 + R)(pı — ô — 5)] - 
— lim [E[D(pi + ô. r1. €1)] + €1 + (A  82)(pm +ô + h)] = 
= lim [E|D(pi,r1.6)] — (81 + 82)ó — (Bı + B2)(d + b) — 


— E[D(pı.rı.&1)] — (Bi + B2)d — (Gr + 83)(6 + h)] = 
= lim [-2(8, + G2)d — (Bı + Ga)(h + b)] = —(À + Ba)(h + b) > 0. 


ensuring concavity in pı and thus proving (A.16). 


Now let yı = yı—ô with ô > 0 in order to approach the kink at e4 = yi — E[D(pi. 7i. €)] 
from the left side in yı . Hence z} becomes 


T2=Y1- ô — E|D(n = 0. r3. €1)] — €1 
= yı — ô — ElD(p.rı.c)] — n + E[D(pri.6)] = -ó < 0. 
Since z; < 0 we have Ih (z1, yı — 6.21.71.€1) = I (21. y1 — 6.p1.71.€1). 


Let yy = yı +ô. Then similarly to above we obtain x2 = ô > 0, which hence gives 
Il (zi.31 + 6, pi. 71,61) = I (31.1 + 6, pi. 71. €1) . We are now ready to calculate 


lim [Th (21; yı 7 & pi. 0.6) ~ Hine yı + 6, pi: 7i. 6)] = 


lim [Ir (21.34 -àp.n.&) - Hin. yı + à. pı. 1. €1)] : 
which gives 


lim [n (E [D(pi.r1.68)] + €1) - ely +8- z1)- 


— b(EID(p.rı.a)+ö+a-Y)- 
= mE [Dnran den 4m) 
+hlyı — ô — E[D(pi.ri.i)) — e)] = 

lim [=2c6 + (h +b)(yı — 6 — E[D(p. r1.68)] - €1)] . 
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Substituting e; = yı — E[D(pi, ri, €1)| gives 


lim [I (zi, yi —óp,rui&) - Ih (a1, y tóp,nria)] = 
lim [- (2c +h + b)ô] = 0, 


which results in equation (A.15). 
Furthermore, we obtain 


lim | Hirth = Spire) _ Olivi y + po roe) 


E b) >0 
sd yi Oy we 


which ensures concavity in y; and thus proves (A.17). 
[] 


Lemma A.6. For the linear demand function (5.0.1), a constant 6 > 0 and € = y — 
E|D(pyr1,6)] — S*(ra(py,r1)) the following holds for the optimal expected profit of the 
second time period v2(Z2(y1. pi, ri, €1), Talpı,r1)): 


lim [v2 (za(yı,Pı -6, T1, €), T2(pı eu. T1) ES vo(zxo(yi, m +9, Ti, €), ra(pı +ô, 71))] = 0, 


(A.18) 
lim [va(zo(y1 — ô, pi, T1, €1), r2(pi, r1) — va(zo(yi +ô, pi. T1, &). r2(pi, rı))] = 0, 
(A.19) 
ie Ov2(Z2(yi, Pı —9, 71, 61), (p 9, r1)) 9o (zo(yi pi +0, r1, 6) 20 +4, 7i) EE, 
8—0 Op; Op, 
(A.20) 
| Ovo(x2(y1—4, P1, 71; €1),72(P1,71)) — Qoo (2 +4, P1, T1, €1), 72(P1, 71) = 
im | EPC ie i 
8—0 Oy; On 
(A.21) 


Proof. Let € = yı — E|D(py, ry, €i)] — S*(ra(pı,rı))), yı be fixed and 6 > 0 a con- 
stant. Substituting pı by pı — ô lets us approach the kink at €; = y, — E[D(pi, 71, €1)] — 
S*(ra(pı,rı))) from the left side in pı. As in lemma A.5, the inventory before ordering in 
the second time period za(yi, Pı — 6,71, €1) becomes 


22 = yı - ElD(m — 4,71,41)| - 4 
= y, - E[D(p.ri,6)] + (A + 82) — yı + E[D(p1, r1, €&1)] + S*(ra(pı, rı))) 
= 5*(rz(pı,rı))) + (b1 + Be)d. 


Since (A + 85)ó € 0, we have za(yi,p1 — 6,71, €1) € S*(ra(pı,rı))) and it clear that 
vol ta(yisPr = Ô, ra, 6) Paley COE 753804 
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Substituting p; by pı + ô lets us approach the kink at ej = yi — E[D(pi.r1,61)] — 
S" (ro(pi, r1))) from the right side in pı . Similarly to above za(yı. pi + 6,71.€,) becomes 


£2 = S'(ra(py. r1)) — (Bi + 82)ó > S*(re(pi.71)) 


and thus 


vX(xa(yi. Pı + 6.71, 61). ro(pi +8. r1)) = 995 (xo(yi. pi + 6,71, €1), ro(pi + 6.rı)). 
We are now ready to prove equation (A.18) , which by the above results becomes 
lim [V7 (ra(m — à, r)) - 95(xo(yi. Pı + 4.71, €1), ro(p1 + 8; 71))]. 


Furthermore, we can see by Lemma A.4 that 


Py (ri(pi. 1)) - ((1 — a)ß28)/(2 (81 + B2)). 
Pj' (ra(pi.ri)) + ((1 — 0)828)/(2(81 + Be)). 


P;'(ro(py — à.r1)) 
po' (zo(yi. P1 + 6,71. €1), ralpı + ô. rı)) 


which yields the useful properties that 


E|D(P?'(ro(pi-ó.01)) r2(p 76.0). 63)]) = EID(Pr*(ra(pı.r1)).r2(pı: 71). €2)| + O2) 
E{D(p2*(r2(p1 +6, rı)).ra(pı +6, rı).e2)] = E[D(P2*(ro(pi.r1)). r2(p1; r1); €2)] + O(6). 


where O(ó) and O(6) are two functions of 5 with 2 ô) = 0 and lims—o O(ó) = 0. 
Using equations (5.1.1) and (5.1.16), we by letting y? — | (scc i) we obtain 


S'(ro(p — 0, 1))) - E[D(P»' (r(py — ò. rı)).ra(lpı — ð: rı), €2)] = 
= y + E[D(P2"(ro(pi — 0,91)) Pol pi — ô. r1).€2)|— 
— E[D(P;' (ra(py — à. r1)). ra(p1 — 4.71), €2)] = 9. 


Moreover, note that for linear demand (5.0.1) E[D(p— à. r.e)] = E[D(p.r.e)] -0(81 + 32), 
which gives 


V5(ra(m — à. r1)) 
—E[UI(S"* (ra(pi. r1))) + (b1 + 82)8. S" (r(p1 -6. r1))). Po" (r2(p ô. r)). r(pi -9. ri). 21 
=P,*(ra(pı — 6.3) E[D(P?' (r(px — ó.ri)). r2(pı — 6.71). €2)]- 
— 2c(0, + B2)d — G(S"(ro(px — à. r1)). Po” (ra(py — 6.171). r2) 
=P," (ra(pı.r1))E[D(Pz* (ro(pi. rı)).ra(pı: 71): €2)] + O(ö)Pr*(ra(pı: r))— 
— 2c(81 + B3)ó — G(S"(ra(pi — à. r1)). Pa*(ra(pı — ô. r1)). r2) 
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and for ar _ = £o(y1, pı + ô, T1, €1) and pete = ro(p + ô, rı) 


G3(r2(y1,Pi + 5,71, €1),72(P1 +6, r1)) = E[I (22 *^, pre] 
=p (22t? Pi) EL D(pr(zD:*5, pei) Bto ee 

- G(S*(ro(m,n1)) — (bı + 82)6, -— pom) 
=P," (r(py, ri) E[D(P' (r2(p,71)), ra(pı, 71), €1)] + O9) Po* (r2(p1, 1))— 

— G(S'(ra(pi. r1)) — (Ai + 83)8, Pz (ro(pi + ô, r1)), T2), 


since p3(zo(yi, Pi + 6,71, €1), ro(p1 + 0,71)) = P2 (ro(pi + 6,71)) by lemma A 4. 
From the above two equations and by the fact that G(y, p,r) is a continuous function and 
lims_.o O(5) = 0, lims—o Ö(6) = 0, it is easy to see that 


lim[V (ra(pi — ó,71)) - SU5(zo(yi, pi + 4,71, €1), ro(p1 + ó,71))] = 0, 


which proves equation (A.18). From equation (5.1.11) it is easy to see that for P7 — 
Pš (ro(pi,r1)) the following holds: 


9G(S* (ra(pi, r1)), P (ro(pi, 1) r2) _ 
Op, 
= (A + B2)(b— (h + b) F(S*(ro(p1, 71)) — E[D(P2", re, €2)])) 


OG(S*(r2(p1,71)) — (bı + G2)d, Po" (ro(pi + 9,71)), 72) _ 
Op, 
= (bı + B)(b — (h + b)F(S*(ro(p1,11)) — (G1 + b2)ô — E[D(P.*, ra, €2)] — O(6))). 


By the continuity of F(-) it follows directly that 


im [AS m UA E EH PuTOBRI 
Opı 
Op, 


— 


equals zero and thus 


. [8 3 
lim EAU adm = ir) Dp, 2l Pi +ô, r1, €1), r2(py + ô, n» zi 


ó—0 


which proves equation (A.20). 
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Similarly to the proof above, we now let y = yi — ô in order to approach the kink from 
the left side with respect to y1. zo(yi — 6. pi, r1, €1) becomes 


%2 = Yı -6 - E[D(pi. ri. €i)] — € 
=y — ô - E[D(p.ri.e&)) - yı + E[D(pi.ri 6)] + S" (2(1.71))) 
= S'(ro(p.m1))) - ô € S*(re(pi.11)))- 
Thus vo(zo(yi — ô. py, r1. &1). r2(p1. r1)) = Vg (ro(pi 7i). 


Substituting y; by yı + 6 lets us approach the kink from the right side with respect to y. 
Similarly to above zo(yi + ô. pı. r1. €1) becomes 


T2 = S'(ro(pi. rı))) + 6 = S" (r(pi.i))) 
and thus 
vj(zo(yi — 6. py. T1: 61) r2 (p. i)) = Wo(L2(y 9 pi ri 61) o(pi 1). 
We are now ready to prove equation (A.19) , which by the above results becomes 
lim [V2 (r2(p1: r1)) — 85(zo(yi + Ò. pi ri. €i). 2(p1. (1))]- 
Analogue to above we obtain 


Vilrs(p.rı)) 
—E[II(S*(rz(pi.r1))) — à. S*(r2(p1; rı))). Pa*(ra(pı: rı))- r2(pı; r1): €2] 
=P;*(r2(pı; ri)) £[D(P?' (ro(px. r1)). T2(91. 1). €2)] — có— 

— G(S" (rs(pi. r1)). P?' (ra(pi. rı)). 2) 


and for ryt) = zo(y1 + 6.71.71. €1) 


Wo(r2(yi + ô. pi. 1,61). r2(p 4.7) = 
= EIN(S*(rx(pı.rı))) + 6, S*(ra(pı. r1)) + 6. p (x15 ro (pi r1)) r2(p1. 71). 6i] 
= Px (ra(p. r2) E[DCP2" (ra(po. r2)). ra(px. 71). €2)]- 
— G(5*(ra(pı.rı)) + à. P' (ro(pi.r1)). r2). 
since p3(ro(yi + 6; pi. r1. 61). To(pi. ri)) = Pz(ro(mp. ri)) by lemma A 4. 


From the above two equations and by the fact that G(y. p.r) is a continuous function it is 
again easy to see that 


lim[Vz (ro(mi. 1) — 83(zo(i i-i 61). 2n. 71))] = 0. 
which proves equation (A.19). From equation (5.1.9) we see that for Ps = P3(r2(p1.171)) 
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the following holds: 


oy 
= —b + (h -b)F(S'(r2(p,r1)) - E|D( Po", ro, €2)]), 


REICHEN ZEV T 0, Fa (rp rm). 
Oy 
= —b4+(h+b)F(S*(ro(pi,71)) + 6 — E[D(P2*, re, €2)]). 


By the continuity of F(-) it follows directly that 


li [SS upra i rre mper) 
im | —-—--- M 
6-0 Oy 


ðyı 


equals zero and thus 


E NT ae 
lim Pac (ro(p1,71)) = ay, Da (svi +6, p nsa) npn) = 0, 


which proves equation (A.21). 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Bibliography 


Arrow, Kenneth J., Theodore Harris, Jacob Marschak. 1951. Optimal inventory policy. Econometrica 
19(3) 250-272. 


Bellman, R., O. Glicksberg, I.and Gross. 1955. On the optimal inventory equation. Management 
Science 2(1) 83-104. 


Bertsekas, Dimitri P. 2001. Dynamic Programming and Optimal Control, Vol. II. 2nd ed. Athena 
Scientific. 


Bertsekas, Dimitri P. 2005. Dynamic Programming and Optimal Control, Vol. I. 3rd ed. Athena 
Scientific. 


Bijmolt, Tammo, Harald Van Heerde, Rik Pieters. 2005. New empirical generalizations on the deter- 
minants of price elasticity. Journal of Marketing Research 42(2) 141-156. 


Briesch, Richard A., Lakshman Krishnamurthi, Tridib Mazumdar, S. P. Raj. 1997. A comparative 
analysis of reference price models. Journal of Consumer Research 24 202-214. 


Chan, L. M. A., Z. J. Max Shen, David Simchi-Levi, Julie L. Swann. 2004. Coordination of pricing 
and inventory decisions: A survey and classification. David Simchi-Levi, S. David Wu, Zuo-Jun 
Shen, eds., Handbook of Quantitative Supply Chain Analysis — Modeling in the E-Business Era, 
chap. 9. Kluwer Academic Publishers, 335-392. 


Chen, Xin, David Simchi-Levi. 2004a. Coordinating inventory control and pricing strategies with 
random demand and fixed ordering cost: The finite horizon case. Operations Research 52 887- 
896. 


Chen, Xin, David Simchi-Levi. 2004b. Coordinating inventory control and pricing strategies with 
random demand and fixed ordering cost: The infinite horizon case. Mathematics of Operations 
Research 29 698—723. 


Eliashberg, Jehoshua, R. Steinberg. 1993. Marketing-production joint decision-making. Jehoshua 
Eliashberg, G. L. Lilien, eds., Marketing, Handbooks in Operations Research and Management 
Science, vol. 5, chap. 18. North Holland, Amsterdam, 827-880. 


Elmaghraby, Wedad, Pinar Keskinocak. 2003. Dynamic pricing in the presence of inventory con- 
siderations: Research overview, current practices, and future directions. Management Science 
49(10) 1287-1309. 

Federgruen, Awi, Aliza Heching. 1999. Combined pricing and inventory control under uncertainty. 
Operations Research 47(3) 454—475. 


Fibich, Gadi, Arieh Gavious, Oded Lowengart. 2003. Explicit solutions of optimization models and 
differential games with nonsmooth (asymmetric) reference-price effects. Operations Research 
51 721—734. 


Gallego, Guillermo, Garrett Van Ryzin. 1994. Optimal dynamic pricing of inventories with stochastic 
demand over finite horizons. Management Science 40(8) 999—1020. 
Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


122 Bibliography 


Greenleaf, Eric. 1995. The impact of reference price effects on the profitability of price promotions. 
Marketing Science 14(1) 82-104. 


Heuser, H. 1981. Lehrbuch der Analysis. B.G.Teubner Stuttgart. 


Heyman, Daniel P., Matthew J. Sobel. 2004. Stochastic Models in Operations Research, Vol, II: 
Stochastic Processes and Operating Characteristics. Dover Publications. 


Ho, Teck-Hua, Juanjuan Zhang. 2004. Does format of pricing contract matter? Working paper no. 
x105-002, University of California, Berkeley. 

Iglehart, Donald L. 1963. Optimality of (s, S) policies in the infinite horizon dynamic inventory 
problem. Management Science 9 259-267. 

Jung, June Young, Gary Blau, Joseph F. Pekny, Gintaras V. Reklaitis, David Eversdyk. 2004. A 
simulation based optimization approach to supply chain management under demand uncertainty. 
Computers and Chemical Engineering 28 2087-2106. 


Khouja, Moutaz. 1996. A note on the newsboy problem with an emergency supply option. The 
Journal of the Operational Research Society 47(12) 1530-1534. 


Kopalle, Praveen K., Joan Lindsey-Mullikin. 2003. The impact of external reference price on con- 
sumer price expectations. Journal of Retailing 79(4) 225-237. 


Kopalle, Praveen K., Ambar G. Rao, Joao L. Assuncao. 1996. Asymmetric reference price effects 
and dynamic pricing policies. Marketing Science 15(1) 60-85. 

Krishnamurthi, Lakshman, Tridib Mazumdar, S. P. Raj. 1992. Asymmetric response to price in con- 
sumer brand choice and purchase quantity decisions. Journal of Consumer Research 19 387- 
400. 

Lee, Hau L., Steven Nahmias. 1993.  Single-product, single-location models. S.C. Graves, 
A.H.G. Rinnooy Kan, P.H. Zipkin, eds., Logistics of Production and Inventory, Handbooks 
in Operations Research and Management Science, vol. 4, chap. 1. North-Holland, Amsterdam, 
3-55. 

Mazumdar, Tridib, S.P. Raj, Indrajit Sinha. 2005. Reference price research: Review and propositions. 
Journal of Marketing 69 84—102. 

Miranda, Mario J., Paul L. Fackler. 2002. Applied Computational Economics and Finance. MIT 
Press. 

Moon, Sangkil, Gary J. Russell, Sri Devi Duvvuri. 2006. Profiling the reference price consumer. 
Journal of Retailing 82(1) 1-11. 

Natter, Martin, Thomas Reutterer, Andreas Mild, Alfred Taudes. 2006. An assortment-wide decision- 
support system for dynamic pricing and promotion planning in DIY retailing. Marketing Science 
Forthcoming. 

Nerlove, Marc. 1985. Adaptive expectations and cobweb phenomena. The Quarterly Journal of 
Economics 72(2) 227-240. 

Petruzzi, Nicholas C., Maqbool Dada. 1999. Pricing and the newsvendor problem: A review with 
extensions. Operations Research 47(2) 183—194. 


Phillips, Robert L. 2005. Pricing and Revenue Optimization. Stanford University Press. 
Popescu, Ioana, Yaozhong Wu. 2007. Dynamic pricing strategies with reference effects. Operations 
Bean Shae Lisa Gimpl-Heersink - 978-3-631-75380-4 


Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Bibliography 123 


Porteus, E.L. 1990. Stochastic inventory theory. D.P. Heyman, M.J. Sobel, eds., Stochastic Models, 
Handbooks in operations research and management science, vol. 2, chap. 12. North-Holland, 
Amsterdam, 605-652. 

Porteus, Evan. 1982. Conditions for characterizing the structure of optimal strategies in infinite- 
horizon dynamic programs. Journal of Optimization Theory and Applications 36(3) 419-432. 

Porteus, Evan L. 2002. Foundations of Stochastic Inventory Theory. Stanford University Press. 

Puterman, M. L. 1994. Markov decision processes: discrete stochastic dynamic programming. John 
Wiley, New York. 

Scarf, Herbert. 1960. The optimality of (s,S) policies for the dynamic inventory problem. K. J. Arrow, 
S. Karlin, P. Suppes, eds., Mathematical Methods in the Social Sciences. Stanford University 
Press, Stanford, California, 196-202. 

Simchi-Levi, David, Xin Chen, Julien Bramel. 2005. The Logic of Logistics: theory, algorithms, and 
applications for logistics and supply chain management. 2nd ed. Springer series. 

Slonim, Robert, Ellen Garbarino. 2002. Similarities and differences between stockpiling and refer- 
ence effects. Working paper, Case Western Reserve University. 

Smith, James E., Kevin M. McCardle. 2002. Structural properties of stochastic dynamic programs. 
Operations Research 50(5) 796-809. 

Stokey, Nancy L., Robert E. Jr. Lucas, Edward C. Prescott. 1989. Recursive Methods in Economic 
Dynamics. 

Talluri, Kalyan T., Garrett J. van Ryzin. 2004. The Theory and Practice of Revenue Management. 
Kluwer Academic Publishers. 

Tellis, Gerard. 1988. The price elasticity of selective demand: A meta-analysis of economic models 
of sales. Journal of Marketing Research 25(4) 331-341. 

Thomas, Joseph. 1970. Management science vol. 16, no. 11, july, 1970 printed in u.s.a. price- 
production decisions with deterministic demand* joseph thomas price-production decisions 
with deterministic demand. Managment Science 16(11) 747—750. 

Thomas, L. Joseph. 1974. Price and production decisions with random demand. Operations Research 
22 513-518. 

Topkis, Donald M. 1998. Supermodularity and Complementarity. 10th ed. Princeton University 
Books, Princeton, New Jersey, US. 

Veinott, Arthur F. 1965. Optimal policy for a multi-product, dynamic, nonstationary inventory prob- 
lem. Management Science 12(3) 206—222. 

Veinott, Arthur F. 1966. On the optimality of (s,S) inventory policies: New conditions and a new 
proof. SIAM Journal on Applied Mathematics 14(5) 1067-1083. 

Wagner, Harvey M., Thomson M. Whitin. 1958. Dynamic version of the economic lot size model. 
Managment Science 5 89—96. 

Whitin, T. M. 1955. Inventory control and price theory. Management Science 2(1) 61-68. 

Winer, Russel S. 1986. A reference price model of brand choice for frequently purchased products. 
Journal of Consumer Research 13 250—256. 

Zheng, Yu-Sheng. 1991. A simple proof for optimality of (s, S) policies in infinite-horizon inventory 
systems. Journal of Applied Probability 28(4) 802—810. 

Zipkin, Paul Herbert. 2000. Foundations of Inventory Management. Shelstad, Jeffrey J. 

Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Band 


Band 


Band 


Band 


Band 


Band 


Band 


Band 


Band 
Band 


Band 


Band 
Band 
Band 


Band 


Band 


Band 


Band 


Band 


Band 


Band 


15 


16 


Forschungsergebnisse der Wirtschaftsuniversität Wien 


Herausgeber: Wirtschaftsuniversität Wien — 
vertreten durch a.o. Univ. Prof. Dr. Barbara Sporn 


Stefan Felder: Frequenzallokation in der Telekommunikation. Ökonomische Analyse der 
Vergabe von Frequenzen unter besonderer Berücksichtigung der UMTS-Auktionen. 2004. 


Thomas Haller: Marketing im liberalisierten Strommarkt. Kommunikation und Produktpla- 
nung im Privatkundenmarkt. 2005. 


Alexander Stremitzer: Agency Theory: Methodology, Analysis. A Structured Approach to 
Writing Contracts. 2005. 


Günther Sedlacek: Analyse der Studiendauer und des Studienabbruch-Risikos. Unter Ver- 
wendung der statistischen Methoden der Ereignisanalyse. 2004. 


Monika Knassmüller: Unternehmensleitbilder im Vergleich. Sinn- und Bedeutungsrahmen 
deutschsprachiger Unternehmensleitbilder — Versuch einer empirischen (Re-)Konstruk- 
tion. 2005. 


Matthias Fink: Erfolgsfaktor Selbstverpflichtung bei vertrauensbasierten Kooperationen. 
Mit einem empirischen Befund. 2005. 


Michael Gerhard Kraft: Ökonomie zwischen Wissenschaft und Ethik. Eine dogmenhistori- 
sche Untersuchung von Léon M.E. Walras bis Milton Friedman. 2005. 


Ingrid Zechmeister: Mental Health Care Financing in the Process of Change. Challenges 
and Approaches for Austria. 2005. 


Sarah Meisenberger: Strukturierte Organisationen und Wissen. 2005. 


Anne-Katrin Neyer: Multinational teams in the European Commission and the European 
Parliament. 2005. 


Birgit Trukeschitz: Im Dienst Sozialer Dienste. Ökonomische Analyse der Beschäftigung in 
sozialen Dienstleistungseinrichtungen des Nonprofit Sektors. 2006 


Marcus Kölling: Interkulturelles Wissensmanagement. Deutschland Ost und West. 2006. 
Ulrich Berger: The Economics of Two-way Interconnection. 2006. 


Susanne Guth: Interoperability of DRM Systems. Exchanging and Processing XML-based 
Rights Expressions. 2006. 


Bernhard Klement: Ökonomische Kriterien und Anreizmechanismen für eine effiziente 
Förderung von industrieller Forschung und Innovation. Mit einer empirischen Quantifizie- 
rung der Hebeleffekte von F&E-Förderinstrumenten in Österreich. 2006. 


Markus Imgrund: Wege aus der Insolvenz. Eine Analyse der Fortführung und Sanierung 
insolventer Klein- und Mittelbetriebe unter besonderer Berücksichtigung des Konfigura- 
tionsansatzes. 2007. 


Nicolas Knotzer: Product Recommendations in E-Commerce Retailing Applications. 2008. 


Astrid Dickinger: Perceived Quality of Mobile Services. A Segment-Specific Analysis. 
2007. 


Nadine Wiedermann-Ondrej: Hybride Finanzierungsinstrumente in der nationalen und in- 
ternationalen Besteuerung der USA. 2008. 


Helmut Sorger: Entscheidungsorientiertes Risikomanagement in der Industrieunterneh- 
mung. 2008. 


Martin Rietsch: Messung und Analyse des ókonomischen Wechselkursrisikos aus Unter- 
nehmenssicht: Ein stochastischer Simulationsansatz. 2008. 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Band 


Band 


Band 


Band 
Band 
Band 
Band 


Band 


Band 


Band 


Band 


Band 


29 


30 


31 


32 


33 


Hans Christian Mantler: Makroökonomische Effizienz des Finanzsektors. Herleitung eines 
theoretischen Modells und Schätzung der Wachstumsimplikationen für die Marktwirt- 
schaften und Transformationsókonomien Europas. 2008. 


Youri Tacoun: La théorie de la valeur de Christian von Ehrenfels. 2008. 


Monika Koller: Longitudinale Betrachtung der Kognitiven Dissonanz. Eine Tagebuchstudie 
zur Reiseentscheidung. 2008. 


Marcus Scheiblecker: The Austrian Business Cycle in the European Context. 2008. 
Aida Numic: Multinational Teams in European and American Companies. 2008. 
Ulrike Bauernfeind: User Satisfaction with Personalised Internet Applications. 2008. 


Reinhold Schodl: Systematische Analyse und Bewertung komplexer Supply Chain Pro- 
zesse bei dynamischer Festlegung des Auftragsentkopplungspunkts. 2008. 


Bianca Gusenbauer: Öffentlich-private Finanzierung von Infrastruktur in Entwicklungslán- 
dern und deren Beitrag zur Armutsreduktion. Fallstudien in Vietnam und auf den Philippi- 
nen. 2009. 


Elisabeth Salomon: Hybrides Management in sino-österreichischen Joint Ventures in 
China aus österreichischer Perspektive. 2009. 


Katharina Mader: Gender Budgeting: Ein emanzipatorisches, finanzpolitisches und demo- 
kratiepolitisches Instrument. 2009. 


Michael Weber: Die Generierung von Empfehlungen für zwischenbetriebliche Transaktio- 
nen als gesamtwirtschaftlich Infragstrukturleistung. 2009. 


Lisa Gimpl-Heersink: Joint Pricing and Inventory Control under Reference Price Effects. 
2009. 


www.peterlang.de 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


je Peter Lang - Internationaler Verlag der Wissenschaften 


Ulrike Bauernfeind 


User Satisfaction with Personalised 
Internet Applications 


Frankfurt am Main, Berlin, Bern, Bruxelles, New York, Oxford, Wien, 2008. 
190 pp., num. tab., graph. 

Forschungsergebnisse der Wirtschaftsuniversität Wien. Vol. 27 

ISBN 978-3-631-57770-7 - pb. € 39.-* 


The study focuses on user satisfaction with websites and personalised internet 
applications in particular. The abundance of information on the web is 
increasing more and more. Therefore, the significance of websites targeting 
the users' preferences, like personalised Internet applications, is rising. The aim 
of this study was to find out which factors determine user satisfaction with 
personalised internet applications. Factors like the usefulness of the information 
or trust towards how personal information is handled were considered. A 
large-scale user survey evaluating three internet applications (from the travel, 
e-learning and real estate domains) was conducted. Expert opinions were 
collected to complement the results and provide insights from users' and 
experts' points of views. 


Contents: Personalised internet applications - Human - computer interaction - 
Technology acceptance model - Website evaluation - User satisfaction - 
Development of a research model - Structural equation modelling - Expert 
interviews - Success factors 


Frankfurt am Main - Berlin - Bern - Bruxelles - New York - Oxford - Wien 
Distribution: Verlag Peter Lang AG 

Moosstr. 1, CH-2542 Pieterlen 

Telefax 00 41 (0) 32/376 17 27 


*The €-price includes German tax rate 
Prices are subject to change without notice 


Homepage http://www.peterlang.de 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


Lisa Gimpl-Heersink - 978-3-631-75380-4 
Downloaded from PubFactory at 01/11/2019 05:41:29AM 
via free access 


